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SOME METHODOLOGICAL NOTES ON THE 
DEFLATION OF CONSTRUCTION* 


NorMAN M. Kaptan 
RAND Corporation 


Because of the difficulties in finding homogeneous final outputs, the 
procedure in general use for the deflation of construction involves the 
pricing of inputs rather than outputs and consequently ignores changes 
in productivity. The present paper proposes: (a) a re-definition of 
output in terms of intermediate products; and (b) because of the in- 
applicability of a direct method, two indirect methods for obtaining a 
measure of the change in the average price of intermediate products. 
One of the two methods combines an index of input prices and a meas- 
ure of productivity changes. the latter of which may be of some inde- 
pendent interest. 


1, INTRODUCTION 


N INTERTEMPORAL or international comparisons of national product, con- 
I struction is a particularly troublesome sector. The final output of the con- 
struction industry is almost completely heterogeneous, is largely produced to 
order rather than for stock, and is frequently priced on a cost-plus basis. Thus, 
the construction industry exhibits in extreme form virtually all the problems of 
measuring changes or differences in real output which appear, though less pre- 
dominantly, in the rest of the economy. Similarly, it is difficult to define and 
measure appropriate price relatives as elements of a price index number to 
deflate—i.e., to compare in constant or common prices—the values of con- 
struction in current or native prices. 

If we survey even superficially the procedures followed in the deflation of 
construction, it is evident that these difficulties have led to various kinds of 
compromises. In most cases, these compromises have taken the form of pricing 
inputs rather than outputs. In all cases, the descriptions of procedures or the 
estimates of the reliability of the results indicate appreciation of these difficul- 
ties and dissatisfaction, to some degree, with the resulting compromises. 

The purpose of this paper is twofold: (1) to describe and comment on some 
of the procedures employed in the deflation of construction; and (2) to suggest 
alternative approaches to the problem. The survey of procedures is unsys- 
tematic and cursory: the procedures described are those which I already know 
or could readily ascertain and, for present purposes, only the general nature 


* I am indebted to Richard Moorsteen of the RAND Corporation and Raymond P. Powell of Yale University 
for helpful comments on this paper and discussions of the underlying issues. 
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of these procedures are relevant. The alternative approaches are tentative sug- 
gestions. Their applicability and importance are, in large part, empirical 
questions which have not been explored. The suggestions, therefore, are essen- 
tially suggestions for further study. 


2. SOME PROCEDURES USED IN THE DEFLATION OF CONSTRUCTION 


Three instances of intertemporal deflation of construction and two of in- 
‘ternational deflation have been examined in connection with the present paper. 
In the three instances of intertemporal deflation, the procedures used are, or 
are reducible to, the pricing of inputs rather than outputs. The British Central 
Statistical Office uses a straightforward index of building materials’ prices 
and labor earnings per man-hour in order to obtain the value of construction 
in 1948 prices from the value of construction in current prices.' To obtain an 
index of the volume of Soviet construction in 1937 rubles, Raymond Powell 
moves the 1937 value of construction (in 1937 prices) backward and forward 
in time by the use of a 1937-price-weighted index of the volume of materials 
used in construction*—another instance of an approach which deals with inputs 
rather than outputs.’ > 

The US Department of Commerce uses a variety of price indexes—none of 
its own making—to deflate the various sub-aggregates of total construction in 
current prices.‘ For some sub-aggregates, the price indexes are straight- 
forward indexes of wages and materials prices—e.g., the Engineering News- 
Record Indexes. For others, the price indexes refer to types of buildings or to 
building components such as wiring—e.g., the E. H. Boeckh and Associates 
Index and the George A. Fuller Company Index. In most of these latter cases, 
the price indexes do not refer to the change in the selling price of a particular 
type of building or in the cost of a particular building component; they refer 
rather to the change in price of the fixed list of materials and labor involved in 
the building or building component. In a few cases, however, the quantities 
of materials and labor involved in the building or building component are not 
held constant over time—e.g., the Bureau of Public Roads Highway Index and 


1 Central Statistical Office, National Income Statistics Sources and Methods, London: Her Majesty's Stationery 
Office, 1956, p. 300. 

2 Raymond P. Powell, A Materials-Input Index of Soviet Construction, 1927/28 to 1955, Part I, The RAND 
Corporation RM-1872, February 14, 1957; and Raymond P. Powell, A. Materials-Input Index of Soviet Construction, 
1927/28 to 1955, Part II—Appendices, The RAND Corporation RM-1873, February 14, 1957. A summary of the 
results of this study will appear in Review of Economics and Statistics, May, 1959. 

* Whether one deflates with an input price index number or uses a volume of inputs index number directly as 
an index of the volume of construction is a side-issue for purposes of my paper, however important it may be from 
a statistical point of view; the two approaches will nct necessarily yield the same results unless the inputs included 
account for a stable proportion of the total value of construction at current prices. 

On the other hand, many of the issues raised in my paper are also discussed by Powell. For example, after a 
discussion of US index numbers and productivity changes, Powell justifies the omission of labor from his index on 
the following grounds. “In view of the evident possibilities for input substitutions, it is reasonable to suppose that 
the long-run effccts of mechanization on the productivities of labor and materials, absolute as well as relative, have 
been much the same in Soviet construction as in the US: an increase in the productivity of labor and a near-con- 
stancy in the productivity of materials.” (Powell, A Materiais-Input Index of Soviet Construciion, 1927/28 to 1956, 
Part I, 67). If this has been the historical course of productivity change, then Powell's index is preferable to any 
of the indexes I will suggest for it is accurate and requires less information. 

4 For the basis of these and subsequent remarks about US procedures see: United States Department of 
Labor and Department of Commerce, Construction Volume and Costs 1915-1954, A Statistical Supplement to Con- 
struction Review, Washington: Government Printing Office, 72-6; and United States Department of Commerce, 
National Income 1954 Edition, A Supplement to the Survey of Current Business, Washington: Government Printing 
Office, 1954, 156 and 157. 
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TABLE 537. THE RELATIONSHIP BETWEEN THE DEPARTMENT OF 
COMMERCE COMPOSITE INDEX AND AN INPUT PRICE INDEX 


Department of Department of 
Input Price Commerce Input Price Commerce 
Index Composite Index Composite 
Price Index Price Index 


(1947-1949 = 100) (1947-1949 = 100) 


1945 66.8 
1950 109.9 
1951 119.6 
1952 120.7 
1953 124.1 
1954 126.1 


the Turner Construction Company Index. Thus, the Department of Commerce 
procedures involve primarily, though not exclusively, input price indexes.$ 

The results obtained by the Department of Commerce procedures appear to 
be substantially the same as those obtainable by a straightforward index of 
building materials prices and hourly wage rates. If we compare (a) the implicit 
deflators involved in the Department of Commerce procedures for total private 
and public new construction (the “Department of Commerce Composite 
Index”) with (b) the Bureau of Labor Statistics indexes of union hourly wage 
rates for all workers and of building materials prices, the Department of Com- 
merce Composite Index can be seen to lie, with rare exceptions, between the 
two BLS indexes. Moreover, weighting the BLS indexes by the relative im- 
portance of wages and materials in 1947 construction costs,’ I obtain an average 
input price index which in Table 537 is compared with the Department of 
Commerce Composite Index. For all practical purposes, the two indexes are 
identical. Thus, the Department of Commerce Composite Index behaves, in 
essence, as an input price index. Those constituent price indexes which are 
not strictly input price indexes either yield the same results as input price 
indexes or are applied to sub-aggregates of construction which are relatively 
unimportant. The use of the Composite Index for deflation purposes implies 
that there has been no change in the productivity of construction inputs be- 
tween World War I and now—an implication which is not necessarily wrong 
but which certainly warrants further study. 

Of the two instances of international deflation examined, one uses procedures 
which do not rely on comparisons of input prices. In binary comparisons of the 
volume of construction in the United States, the United Kingdom, France, 


5 The National Income Division of the Department of Commerce for purposes of national product calculati 
makes a “rough adjustment” of the data provided by the Building Materials and Construction Division for time 
changes in profit margins. This adjustment, however, seems to be relatively insignificant except for the war years. 
See: Construction Volume and Costs 1915-1964, 1 and 36; and National Income 1954 Edition, 216-7. 

* See Construction Volume and Costs 1915-1964, 27, 28, 32-4. 

7 See Bureau of Labor Statistics, Construction in the 1947 Interindustry Relations Study, BLS Report No. 2, 
February, 1953, 51 and 57. The weights, obtained from the distribution of total inputs for new construction in 
1947 at purchaser’s value, are .64 for the cost of materials (including parts, containers, and supplies) and .36 for 
wages and salaries. These inputs combined account for 83 per cent of the 1947 value of new construction. 


Year 
a 1915 26.3 | 28.4 66.7 
1920 66.0 64.0 106.5 
aa 1925 52.7 50.8 115.4 
aoa 1930 51.9 50.1 | 119.1 
1935 | 47.1 46.6 121.8 
ie 1940 | 54.6 51.8 | | 121.6 
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Germany, and Italy, Milton Gilbert and Irving Kravis use two kinds of price 
ratios.* First, they price in each of the countries a factory building of a type 
common to all countries and five different road surfaces at least one of which 
is used in each pair of countries. Secondly, they price in each of the countries 
eleven different units of building work—e.g., a cubic meter of bulk excavation 
by machine. In both cases, apparently, the prices assigned to the structures or 
the units of building work reflect the methods of construction and specifica- 
tions in the country concerned. Different combinations of the individual price 
ratios are used to deflate different sub-aggregates of construction. For present 
purposes, it is unnecessary to discuss in any detail the weighting procedures 
used except to note the apparent existence of serious difficulties in weighting the 
price ratios for the units of building work. Thus, to obtain a price index number 
for residential building operations, the price ratios for the units of building 
work are weighted by the relative importance of each kind of building work, 
as determined by “a United States government agency,” in the construction of 
“a house of brick or stone similar to the typical French house.”*® 

The other instance of international deflation is a comparative study of Soviet 
and American construction costs in which the present writer participated.’® In 
this case, ruble-dollar price ratios are obtained for twenty-three buildings or 
other structures of Soviet design and specifications and with Soviet input utiliza- 
tions. If these price ratios are representative of Soviet construction and if each 
price ratio were weighted by the relative importance of the building or structure 
in Soviet construction, the procedure would be equivalent to obtaining an 
average of the ruble-dollar price ratios for inputs, weighted by the relative im- 
portance of that input in Soviet construction.'' An examination of the results 
of the procedure described and of other data led to the conclusion that ruble- 
dollar ratios for similar buildings but of American design and specifications and 
with American input utilizations would be quite different. The duplication of 
the procedure described for buildings with American design, specifications and 
input utilizations would be equivalent to obtaining an average of the ruble- 
dollar price ratios for inputs, weighted by the relative importance of that input 
in American construction. Thus, the procedure in the USSR—US comparison 
is equivalent to that employed by the US Department of Commerce in those 
cases where intertemporal price indexes are derived for specific types of build- 
ings, except that the Department of Commerce apparently feels no need for 
alternatively designed buildings (or alternative sets of weights for inputs). 

This survey indicates that, with two major exceptions,’? the measurement of 


* Milton Gilbert and Irving B. Kravis, An International Comparison of National Products and the Purchasing 
Power of Currencies, Paris: Organization for European Economic Cooperation (no date), 192-5. 

* Gilbert and Kravis, op. cit., 194. 

10 See Norman M. Kaplan, A Comparison cf Construction Costs in the USSR and the U.S., The RAND Cor- 
poration RM-1986, September 23, 1957. This study was carried out by the Stanford Research Institute under the 
general supervision of The RAND Corporation. 

1 See equation (9) below. 

12 There are other exceptions which are minor in the sense that they refer to only a part of total construction— 
e.g., the Bureau of Public Roads Highway Index and the Turner Construction Company Index used by the De- 
partment of Commerce. Also, Raymond Powell has called my attention to studies by Lowell J. Chawner, Miles L. 
Colean and Robinson Newcomb, and David M. Blank, which price common outputs or units of building work. 
For the citations to these studies and for Powell's comments on the results of these studies and the issues they con- 
cern, see his A Materials-Input Index of Soviet Construction, 1927/28 to 1955, Part I, 45-54. 
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the volume of construction has been handled by a measure of the volume of 
inputs used in construction or by a deflation procedure using input- rather than 
output-price indexes. The two exceptions are provided by the Gilbert and 
Kravis study in their pricing, first, of common outputs and, second, of units of 
building work. Since the essence of the problems in the deflation of construction 
is the existence of very few common outputs, this notion would seem to have no 
general applicability. The employment of units of building work, however, does 
seem to me to be a very promising notion and its acceptance is the basis of 
subsequent suggetions. 

It should also be noted that in all the preceding instances cf the measure- 
ment of the volume of construction, there is evidence of dissatisfaction with the 
the results. Where input prices or quantities are used, the dissatisfaction arises 
from the difficulties in measuring productivity in construction and the conse- 
quent neglect of an explicit allowance for productivity change or differences." 


3. THE “BASIC” APPROACHES TO THE DEFLATION OF CONSTRUCTION: 
AN ALGEBRAIC STATEMENT AND SOME COMMENTS 


The foregoing account of procedures in use in the deflation of construction 
reveals a diversity of approaches. Before discussing them, it is necessary to 
make some definitional, terminological and notational remarks. First, let us 
distinguish construction projects, components, and inputs. A construction 
project is the final product of construction activity—an apartment building, for 
example, or a railroad line. Construction inputs are goods and services used in 
construction activities—building materials, labor, machine-hours. Construc- 
tion components are intermediate products of construction activity—cubic 
yards of concrete foundation, cubic yards of brick wall, ete. 

Let us assume that a project can be completely analyzed in terms of compo- 
nents and that each component can be completely analyzed in terms of inputs. 
Thus, profits or losses in construction and those overhead costs which cannot 
themselves be described in terms of components or inputs will be ignored in 
what follows. Though this is far from satisfactory, it should be noted that 
special consideration of such overhead costs and profits or losses is inevitable 
in any deflation procedure which does not rely on the pricing of homogeneous 
final products. 

Purely as a matter of terminology, I will use “specifications” in reference to a 
project to mean the quantities of components involved in that project. I will 
use “input utilizations” in reference to a component to mean the quantities of 
inputs used per unit of the component. Both the “specifications” of a project 
and the “input utilizations” for a component are expressed in physical units. 

It is, of course, possible to express an aggregative price index number as an 
appropriately weighted average of price relatives. Thus, if P, and P, represent 
base year and given year prices respectively and Q, and Q, represent base year 
and given year quantities respectively, 


3 See: Central Statistical Office, National Income Statistics Sources and Methods, 300; United States Depart- 
ment of Commerce, National Income 1954 Edition, 156; Powell, A Materials-Input Index of Soviet Construction, 
1927 /28 to 1956, Part I, 33-54, 67-72; Gilbert and Kravis, op. cit., 192-3; and Kaplan, op. cit., 58-74. 
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At various places below, I will take advantage of equalities of this sort by 
omitting the algebraic step which permits me to write the aggregative form of 
the price index number and to speak about the result as a weighted average of 
price relatives. 

For purposes of a slight simplification of an already cumbersome exposition, 
I will make my points in terms of USSR-US construction cost comparisons 
rather than in terms of intertemporal deflation involving base and given years. 
Each of these points, however, can be translated into a proposition which is 
relevant for intertemporal deflation by the substitution of “base year” for 
“USSR” and “given year” for “US”. Thus, I will use the following notation. 
(Some of the symbols will not be used until section 4, but all are defined here 
so as to have them in one place.) Let: 


U;=ruble price of project « of Soviet specifications and input utilizations; 
V,=dollar price of project 7 of American specifications and input utiliza- 
tions; 
U,/=ruble price of project « of American specifications and input utiliza- 
tions; 
V,/ =dollar price of project 7 of Soviet specifications and input utilizations; 
U,/’=ruble price of project « of American specifications and Soviet input 
utilizations; 
V,’’ =dollar price of project i of Soviet specifications and American input 
utilizations; 
W,=ruble price per unit of component j with Soviet input utilizations; 
X,;=dollar price per unit of component j with American input utilizations; 
W,' =ruble price per unit of component 7 with American input utilizations; 
X,'=dollar price per unit of component j with Soviet input utilizations; 
Y, =ruble price of input k; 
Z, =dollar price of input k; 
Aj;=Soviet quantity of component j in project 7; 
B;,=American quantity of component j in project 7; 
C,.;=Soviet quantity of input k in component j; 
D,;= American quantity of input k in component 7; 
E,;=Soviet quantity of project 7; 
F;=American quantity of project 7; 
G;=Soviet quantity of component 7; 
H;=American quantity of component j. 


The assumption that a component can be completely analyzed in terms of in- 
puts implies the following relationship 


W;= Dd (1) 
k 


and analogous relationships between X; and Z,, between W,’ and Y;, and be- 
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tween X,;’ and Z,. The assumption that a project can be completely analyzed 
in terms of components implies the following relationship 


U;= (2) 
j 


and analogous relationships between V; and X;, between U,’ and W,’, between 
V,/ and X,’, between U;”’ and W;, and between V,’’ and X;. Finally, the quan- 
tities of components and projects are related in the following ways: 


G; = (3) 
H; = > B;iF (4) 


Ordinarily, in the formation of price index numbers for deflation purposes we 
use price relatives of the form 


F 


U; j 
V; DXB, > 
j k 


(5) 


where 
Aji Byi. (6) 


That is to say, we seek commodities whose specifications are as nearly alike as 
possible in the USSR and the US and we use the selling prices of such com- 
modities in each economy. The ruble and dollar selling prices, of course, are 
based on, respectively, Soviet and American methods of production, i.e., input 
utilizations. I begin with the presumption that the heterogeneity of the final 
output of the construction industries in the USSR and the US makes inappli- 
cable the ordinary procedure described by equations (5) and (6) and calls for 
some other approach. 

It should be emphasized that the trouble with the ordinary procedure is not 
the usual index number problem of adding horses and apples. It is rather the 
problem of what to do if neither the horses nor the apples are identical in the 
USSR and the US. From this point of view, one additional remark is necessary 
with respect to the notation introduced above. There, I speak of project 7 in 
terms of either Soviet or American specifications (i.e., quantities of com- 
ponents) which is unambiguous. In addition, however, it is necessary to say 
that one project is distinguished from another in terms of type of service pro- 
vided (housing space vs. electric power) and type of construction (wood vs. 
reinforced concrete). Thus, “project 7” should be rephrased as “project of 
type 7”. In other words, horses are distinguished from apples in terms of the 
type of services each provides and the type of materials of which each is pri- 
marily composed, and the Soviet iiorse differs from the American horse in size, 
weight, and other specifications. 

If it is impossible to think of a comparison of the volume of construction in 
the USSR and the US as a comparison of weighted aggregates of internation- 
ally homogeneous projects, we must re-examine the question of what it is we 
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are, or should be, trying to measure in comparisons of construction output. The 
foregoing account of procedures in use (section 2) revealed an appreciation of 
the inapplicability to construction of ordinary types of price index numbers and 
the development of various alternatives to the pricing of final output. At least 
three “basic” approaches to the deflation of construction are in use: (a) the 
pricing of what I will call “standardized projects”; (b) the pricing of compo- 
ments; and (c) the pricing of inputs. Each of these approaches implies a 
corresponding concept of construction output—i.e., as an aggregate of stand- 
ardized projects, of components, or of inputs. In the present section I will 
explore each of these “basic” approaches. Because each is inadequate in some 
important respect, I am led to the formulation of two additional approaches in 
section 4—a variant of the standardized projects approach and an inputs ap- 
proach with a productivity adjustment. 

Perhaps the most frequently used deflation procedure in the case of construc- 
tion is to price inputs rather than outputs and to use an appropriately weighted 
average of such price relatives as the deflator. In our notation, the inputs ap- 
proach consits in the use of a price index number of the following kind 


i 7 k 


This is a Soviet-weighted input price index number and there is, of course, an 
alternative US-weighted input price index number. To economize space, I will 
consider in the present section only Soviet (or base year) weighted index num- 
bers and I will ignore the existence in each case of an alternative, equally ap- 
propriate, US (or given year) weighted index number. 

It is not difficult to appreciate the attractiveness of the inputs approach in 
comparison with other possiblities. The inputs approach relies on a relatively 
small number of inputs (a) which are common to both economies (or both time- 
periods), (b) for which data on relative importance in construction as a whole 
are more or less readily available,“ and (c) most of which are relatively easy to 
identify, specify, and price. Unfortunately, however, there is a major dis- 
advantage in relying on an inputs approach—namely, that inputs are, after 
all, not outputs. The point is usually put in terms of differences in labor pro- 
ductivity: a man-day of Soviet and American brick-laying labor, though 
superficially the same input, are commensurable only in terms of relative pro- 
ductivity. Thus, a comparison of the volume of construction inputs in the USSR 
and the US may yield results which are quite different from the relative vol- 
ume of, in some sense, outputs. 

A second approach is the pricing of standardized projects. That is to say, the 
relevant price relatives refer to the ruble and dollar costs of a project either of 


4 It is, of course, not necessary for the inputs approach to distinguish between, say, Cz;, Aji, and Ej as index 
number (7) does. The weights needed for the inputs approach are the relative importance of each input, or the 
quantities of each, in construction as a whole (or in some sector of construction, if the sectors are to be deflated 
separately). The distinctions employed in (7) are for purposes of comparability with other price index numbers 
to be developed below. 
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Soviet specifications and input utilizations or of American specifications and 
input utilizations, depending upon whether a Soviet- or US-weighted price 
index number is sought. Thus, in our notation, the standardized projects ap- 
proach involves price relatives of the following kind 


j 


U; 
V;’ Xj Aji 


and alternative, analogously defined, price relatives U,’/V;. Weighting the 
price relatives (8) by the relative importance of each project in Soviet con- 
struction, one obtains a price index number which is identical with that ob- 
tained in the inputs approach. Thus, 


LVL 
i i J i J k 

i i j t j k 


It follows that the standardized projects approach shares the major dis- 
advantage of the inputs approach—i.e., neglect of productivity differences. 
Also, in comparison with the inputs approach, the standardized projects ap- 
proach has a related disadvantage and advantage. From the viewpoint of a 
price index for construction as a whole, the standardized projects approach 
requires more information—namely, information on projects’ designs and rela- 
tive importance; correspondingly, the standardized projects approach makes 
possible price index numbers for sub-aggregates of construction. 

The third approach, the components approach, involves price ratios of the 
following kind 


W; 


For construction as a whole, the Soviet-weighted price index number is 
> WG; 


The rationale for the components approach may be stated in extreme form 
as follows. Though projects are heterogeneous as between the USSR and the 
US, components are homogeneous. Though a typical Soviet thermal power 
station is markedly different from a typical American thermal power station, 
the underlying components—the concrete foundation, the eaith moved, ete.— 
are the same in the two economies. “The same” means not the same quantitites 
of inputs used but the same kinds of intermediate outputs. That is to say, the 
projects are heterogeneous only because they represent different combinations 
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of components which are common to both economies. Let us, then, think of 
construction in the USSR and the US not as aggregates of projects nor as 
aggregates of inputs but as aggregates of components each of which is the same 
in the two economies. For deflation purposes, let us price each component in 
rubles with Soviet input utilizations and in dollars with American input 
utilizations, i.e., as in (10). 

Whatever the appeal of this rationale,“ the components approach involves a 
number of additional empirical questions. For one thing, I have not really 
specified the meaning of a component beyond (a) the general indication that 
it is an intermediate product between inputs and the final building or structure 
and (b) the provision of a few examples. If, in drawing up a list of components 
to be priced, one were to define a component too broadly in terms of technical 
characteristics, important problems of heterogeneity might arise. On the other 
hand, if one were to define a component very precisely in terms of technical 
characteristics, it is possible that the list of components to be priced would 
grow out of hand. 

Related to this problem are at least two others. There may be national quali- 
tative differences in construction, workmanship, materials standards, or dura- 
bility which make apparently identical components really different. Also, for a 
given economy, the price of a component will not be, in all cases, independent 
of the project to which it contributes; for example, the price per cubic yard of 
concrete foundation for a large power project will undoubtedly differ from that 
for a small house because in the former case the scale of operations permits, 
say, more complex materials handling devices. This, however, is troublesome 
only if the price of a component varies more or less continuously with the quan- 
tity of the component used. If there are simply discontinuities depending on 
scale, different scales can be defined as different components. The question, 
then, becomes: Is the number of components sufficiently small to be manage- 
able? 

If reorganizations of existing data collection systems are ruled out, one must 
ask: Are data available on costs and quantities of components? In the case of 
the USSR, abundant information on the costs of components is published as 
part of the Soviet “project-making” and cost-estimating procedures, though 
important data problems arise in the interpretation and use of such informa- 
tion.'® If my understanding of American cost-estimating procedures is correct, 
construction organizations here, too, rely extensively on the cost of compo- 
nents; such information, of a quality unknown to me, appears to be generally 
available in trade publications like the Engineering News-Record. Gilbert and 
Kravis also indicate that such cost information is available, though not without 
problems, for the economies they studied. 

The real information problem is the unavailability of data on the quantities 
or relative importance of components in construction as a whole. In neither 
the USSR nor the US are such data ordinarily or regularly published. Without 


4 Gilbert and Kravis defend the components approach in substantially the same way. See Gilbert and Kravis, 
op. cit., 194. 
%* See Kaplan, op. cit., 4-13, 24-32. 
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a sample survey or census of construction organizations, I do not see how US 
government agencies could collect and publish such data. The kinds of weights 
employed by Gilbert and Kravis for sub-aggregates of construction" is further 
testimony to the poverty of such information. Without data on the quantities 
or relative importance of components, the price relatives for individual com- 
ponents cannot be combined with reasonable accuracy into price index num- 
bers. 

To summarize, I assume that a list of components can be composed in which 
the components are homogeneous as between the USSR and US, or at least are 
relatively homogeneous in comparison with projects or inputs, and are suffi- 
ciently homogeneous to make their price relatives more reliable than those for 
projects or inputs. On this basis, I find the components approach preferable in 
principle to the other approaches heretofore discussed. In practice, however, 
the possible need to price a very large number of components is an important 
reservation and the present unavailability of quantity or relative-importance 
data as weights make appropriate weighting impossible. 


4. TWO ALTERNATIVE APPROACHES TO THE DEFLATION OF CONSTRUCTION 


Given the desirability of the components approach, the problem, then, is: 
Are there alternative approaches, equally acceptable in principle and more 
feasible in practice? In discussing this problem and the two alternative ap- 
proaches I have in mind, I will continue with the notation previously intro- 
duced, a notation which assumes USSR-US construction cost comparisons. 
Also, I will now consider expucitly both Soviet- and American-weighted index 
numbers. Finally, at the risk of oppressing nearly everyone—those who prefer 
algebra, those who prefer numerical examples, and those who hate both—I 
propose, for purposes of clarification, to combine an algebraic treatment of 
each of the approaches with an extended numerical example. The numerical 
example assumes that: (a) Soviet and American construction consists of dif- 
ferent quantities of two projects—a “house” and a “factory”; (b) both the house 
and the factory consist of two components, “carpentry” and “roofing,” whose 
quantities differ as between the house and factory in each economy and, for 
the house or the factory, as between the USSR and the US; and (c) each com- 
ponent involves two inputs, “labor” and “wood,” whose quantities again differ 
as between carpentry and roofing in each economy and, for carpentry or roofing, 
as between the USSR and the US. Under these assumptions, Table 546 pre- 
sents the relevant quantity data and Table 547 the relevant price data for 
inputs, components, and projects. Table 548 presents the alternatively- 
weighted price index numbers yielded by the inputs approach, the stand- 
ardized projects approach and the components approach, as well as the in- 
dividual ruble-dollar price ratios which underlie each approach. In Tables 546- 
548, the symbols and equations in parentheses represent the counterparts, as 
applied to the numerical example, of the notation and relationships previously 
introduced. 

Let us turn now to the alternative approaches. As indicated, I have two in 


1 Gilbert and Kravis, op. cit., 194. 
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TABLE 546. QUANTITY DATA 


USSR US 
1. Quantities of inputs per unit of 
component 
a. Quantity of labor per unit of 
carpentry 3 (Cu) 1 (Du) 
b. Quantity of wood per unit of 
carpentry 8 (Cu) 5 (Du) 
ce. Quantity of labor per unit of 
roofing 4 (C12) 2 (Diz) 
d. Quantity of wood per unit of 
roofing 7 (C2) 6 (Des) 
2. Quantities of components per unit 
of project 
a. Units of carpentry per house 6 (An) 8 (Bu) 
b. Units of roofing per house 9 (An) 2 (Bau) 
c. Units of carpentry per factory 3 (Ai) 10 (Biz) 
d. Units of roofing per factory 7 (Ag) 5 (Bx) 
3. Quantities of projects 
a. Number of houses 2 (Ff) 25 (F:) 
b. Number of factories 5 (EB) 30 (F:2) 
4. Quantities of components 
a. Total units of carpentry 27 | 500 (Hi =Buki + Bik) 
b. Total units of roofing 53 | 200 (H2=BuFi+ BnF:) 


mind. Both assume the desirability of the components approach. That is to 
say, for present purposes, the “true” price index numbers alternatively weighted 
are taken to be: 


We; 
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X iG; 
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and 
W,H; 
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> 


(12) 


In other words, construction is interpreted as an aggregate of components and 
the “true” output comparisons alternatively weighted are taken to be: 


> WG; 
i 
> WH; 
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TABLE 547. PRICE DATA 


Ruble Price 


Dollar Price 


1. Prices per units of inputs 
a Labor 
b. Wood 

2. Prices per units of com- 
ponents 


. Carpentry withSovict 
input utilizations | 
. Roofing with Soviet | 
input utilizations 
. Carpentry with Amer-; 
ican input utilizations 
. Roofing with Ameri- 
can input utilizations 
. Prices per project 
a. House of Soviet speci- 
fications and input 
utilizations 
. Factory of Soviet 
specifications and in- 
put utilizations 
. House of American 
specifications and in- 
put utilizations 
. Factory of American 


specifications and in- | 


put utilizations 

. House of American 
specifications and 
Soviet input utiliza- 
tions 

. Factory of American 
specifications and 
Soviet input utiliza- 
tions 

. House of Soviet speci- 
fications and Ameri- 
can input utilizations 

. Factory of Soviet 
specifications and 
American input uti- 


lizations 


10 (¥1) 
40 (Y2) 


350 (WwW, = YiCuy + Y¥2C2) 
320 (We = Y2C 22) 
210 = YiDut+Y2Da) 
260 (W2' = Y2D22) 
4980 (U, = WiAnt+ W2An) 
3290 (U2 = Wi Ai. + 


2200 (U,’ = + Bx) 


3400 (U,’ = Wi’But 


3440 (U,"’ = W,Bu+W:Bau) 


5100 (Us"’ = Wi Biz + 


30 
5 


130 


155 (X,’ =Z,Ci2+Z2C 22) 


55 =Z,Dit+Z2Da) 


90 


2175 = 


1475 


620 (Vi = Xi1Bn+ X2Ba) 


1000 (V2 = Xi X2Be) 


1140 = NX,Au+X2An) 


795 ( V,’ ‘= XiA 12 + X2A 22) 


The alternative approaches to be suggested here are suitable modifications 
of the standardized projects and inputs approaches—modifications which 
yield the “true” price index numbers aud output comparisons achieved by the 
components approach. The first of the alternatives is a variant of the stand- 
ardized projects approach. Instead of obtaining price relatives which compare 
the ruble and dollar costs of a project either with Soviet specifications and 
input utilizations or with American specifications and input utilizations, let us 
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TABLE 548. THE RESULTS OF THE “BASIC” APPROACHES 


Ruble Dollar Ratios (in Rubles per dollar) 


A. Individual Ruble-Dollar aa 


Ratios: 
1. Inputs Approach y, 
a. Labor .33 
Zi 
b. Wood 8. [+] 
2 
2. Standardized Projects 
Approach 
a. House 
i. Soviet Specifica- U; 
tions and Inputs |2.29 


ii. American Specifi- 
cations and Inputs 
b. Factory 
i. Soviet Specifica- 
tions and Inputs 


o 
o 


ii. American Specifi- 
cations and Inputs/3 .40 


3. Components Approach 
a. Carpentry 6.36 


b. Roofing 3.56 


B. Weighted Average Ruble- 
Dollar Ratios: 
2.25 ¥i(CuGs) + + + Y2(C22G2) 
Zi(CurGi) +Z1(Ci2G2) + +Z2( 
45 [ ¥i(Du Ai) + Yi(Di2H2) + Y2(DaHi) + 
Z:(Du +Z1(Di2H2) +Z2( Dai) +2Z2(DnH2) 


bo 

w 

| 

val 


b. American Weights {3. 


2. Standardized Projects 
Approach 


a. Soviet Weights 2.25 | 


U,'Fi + U2'F2 
b. American Weights 45[ 
ViFi+V2F2 


3. Components Approach 
a. Soviet Weights 


+ X2G2 


b. American Weights [5. 25 


+X2H2 


use native input utilizations in both cases. That is to say obtain price relatives 
of the following kinds: 
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= 16 
Vi X;Bi (16) 
j 


If the price relatives (15) are weighted by the relative importance of the 
projects in Soviet construction and the price relatives (16) are weighted by the 
relative importance of the projects in American construction, the results of 
deflation by such price index numbers are the “true” output comparisons (13) 
and (14). 


That is: 


i j j 
LVF: DUE LVF 
a i i j 


and 


VF: > UE; Wink: > 
i i a j j 


= = = - (18 


i i 


In terms of our numerical example, the proposed variant of the standardized 
projects approach yields average ruble-dollar price ratios of 4.22 rubles per 
dollar with Soviet weights and 5.25 rubles per dollar with American weights 
(Table 550). These are the “true” price index numbers as given by the com- 
ponents approach, whereas the results of both the inputs and standardized 
projects approaches are 2.25 rubles per dollar with Soviet weights and 3.45 
rubles per dollar with American weights (Table 548). 

The second of the alternative approaches suggested seeks to adjust input price 
indexes such as (7) for productivity differences between the USSR and the US. 
The usual notion in this connection is to adjust the hourly-wage relatives for 
differences in labor productivity (output per man-hour), but, for present pur- 
poses, such adjustment alone is incorrect. An American man-hour of brick- 
laying labor may be more productive—in different senses—because it has more 
capital to work with or because it is qualitatively better. If capital inputs are 
included in the input price relatives, the alternative Soviet and American 
weights express, though inadequately, some aspects of the greater productivity 
of an American man-hour by expressing the national differences in the relative 
importance of labor and capital. Similarly, one manifestation of the greater 
productivity of an American man-hour might be a savings in materials, which 
is also reflected in the differences between Soviet and American weights. 
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TABLE 550. A VARIANT OF THE STANDARDIZED PROJECTS APPROACH 


Ruble-Dollar Ratios (in Rubles per Dollar) 


A. Individual Ruble-Dollar Ratios ~ 
1. House 
a. Soviet Specifications* 


b. American Specifications* 


2. Factory 
a. Soviet Specifications* 


b. American Specifications* 


1 
4.37[ 
Uy" 
5.55 [ ~| 
5. 10[ | 


B. Weighted Ruble-Dollar Ratios +UsEs 
1. Soviet eights Vi"Ei:t+V2"Es 


Uv Fi + Ps 
5.25 ViFit+V2F2 ] 


2. American Weights 


* Soviet input utilisations for numerator, American input utilizati for d inator. 


Thus, some aspects of the greater productivity of an American man-hour 
are already expressed in the differences between a Soviet- and an American- 
weighted input price index, but some aspects are not so expressed or are not 
fully expressed. What is needed is a measure of the difference in total input 
productivity—i.e., in the total inputs required per unit of output. Here the 
difficulty is that which originated the entire discussion, the difficulty of de- 
fining and measuring the comparable unit of output. If the desirability of the 
components approach is accepted, then the proper definition of a unit of output 
for purposes of productivity comparisons is a unit of a component. Accordingly, 
let us define input productivity ratios, measured alternatively in ruble and 
dollar prices, for a given component as follows: 


- 


Xi 
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(20) 


It is, perhaps, easier to perceive the meaning of the input productivity ratios 
in the numerical example. As Table 551 indicates, the input productivity ratio 
for carpentry measured in Soviet prices is .600 and measured in American 
prices is .423. That is to say, for a unit of carpentry, American input require- 
ments are 60 per cent of Soviet input requirements, both measured in rubles, 
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TABLE 551. AN INPUTS APPROACH WITH A PRODUCTIVITY ADJUSTMENT 


A. Individual Input Productivity Ratios: 
1. Carpentry ¥iDu + YaDu 

a. Measured in Soviet Prices = 

ZiDu + | 
ZiCu + Z2:Cn 

a. Measured in Soviet Prices + + 


Z:Di2 + 


b. Measured in American Prices 


b. Measured in American Prices ; 
. Weighted Average Input Productivity | Cis + ZC 
Ratios: 


1. Measured in Soviet Prices with Wi'Hy + We'Hs 


Ameri Weight [ | 
merican Weights 657 + 
2. Measured in American Prices with Xt + Eee 


Soviet Weights 533 [ | 
. Weighted Average Input Price Ratios Xi'Gi + X2'Gs 
(Table 547) Divided by Weighted Average 
Input Productivity Ratios: 
1. Soviet Weights for Price and Produc- 
tivity Ratios 


1 
2.25 x 33 = 4.22 


2. American Weights for Price and Pro- 
ductivity Ratios 


3.45 X 


and 42 per cent of Soviet input requirements, both measured in dollars. Or, 
the productivity of Sovier inputs in carpentry is 60 per cent of American in- 
puts, measured in rubles! and 42 per cent of American inputs measured in 
dollars. 

Given these definitions of the input productivity ratios, the use of an ap- 
propriately-weighted input price ratio divided by an appropriately-weighted 
input productivity ratio as a deflator results in a “true” output comparison as 
given by the components approach. That is: 
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It should be emphasized that “appropriately-weighted” here means two 
things: (1) for the input productivity ratios, “appropriately-weighted” means 
cross-weighted in the sense that the individual input productivity ratios meas- 
ured in rubles (dollars) are weighted by American (Soviet) quantities of 
components to obtain the “appropriately-weighted” average input productivity 
ratio; but (2) the American (Soviet) weighted average productivity ratio is 
combined with the American (Soviet) weighted price ratio to obtain the “true” 
deflator. Perhaps, again, these relationships are clearer in the numerical ex- 
ample. As Table 551 indicates, the appropriate combinations of an average 
input price ratio and an average input productivity ratio yields deflators of 
4.22 and 5.25 rubles per dollar, which are the “true” price index numbers as 
given by the components approach. 

In principle, the productivity adjustment to the inputs approach requires 
the same kind of information as that which seemed to make the components 
approach itself infeasible. In order to obtain the average productivity ratios, 
the quantities (or relative importance) of components in Soviet and Ameri- 
can construction are needed and, at present, seem to be unavailable. There is, 
however, a side condition which, if it holds even approximately, greatly reduces 
the information requirements of the productivity ratios and thereby creates 
important advantages for the inputs approach with a productivity adjustment. 
Suppose there is a strong central tendency in the input productivity ratios 
among components. In other words, suppose that whatever productivity dif- 
ferences exist between the USSR and the US, they are roughly the same from 
component to component when measured in a single set of prices. If this is so, 
only a limited sample of components need be studied, and they can be studied 
without reference to the quantities of components involved in Soviet or Ameri- 
can construction. Even if a relatively large sample of components is needed to 
determine the central tendency in the productivity ratios, the distribution of 
the resultant ratios may be such as to obviate the need for the precise quantities 
(or relative importance) of the components in Soviet or American construction 
as weights, given qualitative knowledge of the importance of components. 

Of course, it is also possible to hypothesize a similar side-condition which 
obviates the need to weight precisely the price relatives for components and 
which points to advantages for the direct components approach. Yet there 
seems to me to be important differences between the two cases. For one thing, 
I have the feeling that the likelihood of a strong central tendency in the pro- 
ductivity ratios is greater than that in the components’ price relatives, at 
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least for the USSR-US comparison. More important in the absence of concrete 
studies is the fact that the productivity ratios are used as an adjustment to 
already obtained and weighted price ratios. In the direct components approach, 
exclusive reliance is placed on a weighted average of components’ price rela- 
tives, and important errors in the quantity weights might affect the results 
substantially. In the inputs approach with a productivity adjustment, the same 
errors in the quantity weights would affect only the productivity ratios and, 
therefore, would be expected to have less substantial effects on the results. 
Thus, with presently available information I am inclined to favor the adjusted 
inputs approach over the direct components approach. 

Since the adjusted inputs approach in principle requires unobtainable data 
on the quantities of components in construction as a whole, why not rely on 
the previously described variant of the standardized projects approach which 
yields identical results but has no such data requirements? In comparison with 
the adjusted inputs and direct components approaches, this variant of the 
projects approach has both advantages and disadvantages. Its advantage, 
apart from the absence of a requirement for data on the quantities of compo- 
nents in construction as a whole, is that separate deflators for sub-aggregates of 
construction may be easier to obtain. On the other hand, it requires additional 
information on the design of typical projects in both the USSR and the US. 

A further and perhaps more important disadvantage is a point that has not 
been discussed hitherto. We have been led to a consideration of all these com- 
promise approaches by the heterogeneity of construction projects which im- 
pedes the formation of ordinary price relatives such as (5). By the hetero- 
geneity of construction projects in this connection we meant the paucity of 
projects that are identical in the USSR and the US. The variant of the stand- 
ardized projects approach involves (a) the selection of typical projects in the 
USSR and, alternatively, in the US and (b) the employment, for each set of 
projects, of a weighting system which represents the relative importance of 
the typical project in Soviet or American construction. Thus, the variant of 
the standardized projects approach assumes that, while construction projects 
are heterogeneous as between the USSR and the US, they are less so for the 
USSR and the US considered separately in the sense that typical projects can 
be found and weighted as indicated. I am now questioning whether a relatively 
small number of such projects can be found and weighted to represent all of 
Soviet or American construction. 

Of course, if a relatively small number of typical projects cannot be found, 
this difficulty can be overcome in principle by expanding the list of projects to 
be priced until, as a limit, all projects actually underway are present in the 
list. Thus, in the last analysis, the choice between the adjusted inputs approach 
and the variant of the standardized projects approach turns on the amount of 
information required and the costs of acquiring it. 

If the costs of acquiring additional information are to enter the discussion, 
it is legitimate to ask whether any of the “true” approaches discussed—the 
components approach, the inputs approach with a productivity adjustment, 
and the variant of the standardized projects approach—are worth the improve- 
ment in results they promise as compared with the results of the cheaper and 
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more usual inputs approach. Though I am not prepared to answer this ques- 
tion, a few remarks seem in order. 

Our numerical example selected numbers in such a way as to produce, for 
illustrative purposes, large differences in results as between the inputs and com- 
ponents approaches (Table 548). It is certainly conceivable that there are cir- 
cumstances in which the inputs approach yields results which are so close to 
the “true” price index numbers that any more elaborate approach is a luxury. 
For example, it is obvious that if both the average productivity ratios are ap- 
proximately unity, the results of the inputs approach will be approximately 
the same as the “true” price index numbers. Furthermore, if there is no un- 
ambiguous difference in productivity—if one of the average productivity ratios 
is greater, and the other less, than unity—then either the American-weighted 
or the Soviet-weighted input price ratio will be greater, and the other will be 
less, than the similarly weighted components price ratio. If there is an unam- 
biguous productivity difference—if both the productivity ratios are greater or 

th are less than unity—then each of the American-weighted and Soviet- 
weighted iuput price ratios will be greater, or each will be less, than the simi- 
larly weighted components price ratios.'* Among these various combinatioiis of 
circumstances, there will be some which result in alternately-weighted input 
price ratios which bound the alternately-weighted components price ratios 
and which are themselves not very different in magnitude. If this were the case 
in any actual deflation, then again we might be content with the input price 
ratios. But to determine whether or not it is the case, one must perform the 
alternate calculations and comparisons at least with small samples. Thus, at 
the very least, the approaches and concepts discussed here would seem to be 
useful as ways of studying and evaluating particular deflation procedures in 
use. 

Indeed—and this is by way of summary—all of the “true” approaches sug- 
gested here should be interpreted as suggestions for further study since the 
desirability of any of them and the choice among them turn on empirical mat- 
ters. Thus, the desirability of the components approach depends upon the 
ability to reduce construction to a relatively small number of components, 
suitably defined, in which the price per unit is independent of the number of 
units used in a project and of the kind of project constructed. Though the pro- 
ductivity ratios may be independently useful as an approach to the problem of 
defining and measuring productivity differences in construction, the system- 


E.g., if 


> 1 a 


> and < 


Similarly for the other combinations mentioned. 


ont 

atic application of the inputs approach with a productivity adjustment re- gE: 
RES 

then 


DEFLATION OF CONSTRUCTION 555 


quires either (a) more information on the quantities of components than ap- 
pears to be available at present in either the USSR or the US, or (b) a strong 
central tendency in the productivity ratios, measured in a single set of prices, 
among components. The applicability of the variant of the standardized proj- 
ects approach depends upon either (a) the ability to select and weight ap- 
propriately a relatively small number of typical projects, or (b) a willingness 
to bear the costs of expanding the list of projects. Given the desirability of the 
components approach, then, the choice among the “true” approaches (the 
components, the adjusted inputs, and the variant of the projects approaches) 
turns essentially on the amounts of information required by each, which de- 
pends on the empirical matters just enumerated, and on the cost of acquiring 
such information. 

Though so many aspects of the “true” approaches are unsettled, the need for 
further studies along these lines seems unquestionable. Of the observed de- 
flation procedures in actual use, most—and certainly the most important, the 
official British and American procedures—involve the pricing of inputs rather 
than outputs. The deficiences of such an approach are obvious, appreciated, 
and worrisome. The only alternatives to the pricing of inputs appear to be the 
pricing of final or intermediate outputs. Since the heterogeneity of construction 
projects makes impossible the pricing of identical final outputs, all that seems 
to remain is the pricing of intermediate outputs or some alternative approach 
which amounts to the same thing. 
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AN ECONOMETRIC MODEL FOR UNITED 
STATES AGRICULTURE 


A. CROMARTY 
Michigan State University 


Economists and agricultural economists have done parallel research 
in estimating quantitative economic relations for various economic 
sectors. This study is an attempt to bring together the agricultural and 
non-agricultural sectors by grafting onto a master model of the total 
economy a set of estimated relationships for the agricultural sector. In 
this way it is possible to trace the effects of changes in the non-agricul- 
tural economy through the agricultural sector and in turn to make 
some estimates of the contribution of agriculture to the total. In es- 
timating supply, demand and price relationships within agriculture 
twelve product categories are examined. This permits the results to be 
applied as forecasting devices for disaggregated commodity groups 
within agriculture. The results are not as gratifying as one would wish 
and consequently many of the relationships are being re-examined and 
re-estimated. 


INTRODUCTION 


F RESEARCH may be likened to the forging of links in a chain, then the study 
I reported in this article represents another step toward the completion of a 
chain of results useful to people concerned with agriculture. This study is a 
continuation of a more comprehensive project begun by Lawrence Klein and 
Arthur Goldberger who have published their initial results [1]. The Klein- 
Goldberger “master” model left open a more complete development of the re- 
lationships within agriculture and in this sense may be regarded as a sponsor for 
the present study. 

The study being reported in this article is also of a continuing nature and 
the results presented are considered as preliminary. Plans have been completed 
to extend the project by concentrating in four areas. The first involves a refor- 
mulation of those equations whose parameter estimates appear to be inconsist- 
ent with economic reality. Forecasts of variables since 1953 along with a com- 
parison of subsequently observed real values is a guide in such reformulation. 
The second major step is to incorporate into the data the 1954 census revisions 
and to add the years 1954 through 1957 to the original series. A third improve- 
ment is to include in the modified project the effect of releasing Commodity 
Credit Corporation stocks of commodities onto the domestic market. The final 
planned area of work is to incorporate into the crop supply equations additional 
indexes of climatic factors influencing crop production. Work in the latter two 
phases is currently under way [2]. A more complete report will be published 
upon the completion of the four steps outlined above. 


1, PROCEDURE 


Quantitative estimates of economic structure in American agriculture are 
useful to farm producers, policy makers, and handlers of agricultural products. 
As new data and techniques in computation and theory construction become 
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available, it is possible to describe and measure in increasing detail the generat- 
ing processes by which agricultural products are produced and distributed. 
The model presented below is constructed, and the parameters estimated, in an 
attempt to (1) provide meaningful estimates of some of the macrorelationships 
existing in agriculture, and (2) measure some of the effects of the nonagricul- 
tural sector of the economy on agriculture. 

The procedure followed involves disaggregating agriculture into a series of 
product categories which have somewhat homogeneous demand and supply 
structures. For each product category a single supply relationship is specified 
and estimated. On the demand side, several relationships may be specified and 
estimated depending upon the existence of government loan and purchase 
programs, or inventory stocks, in addition to commercial demand. In the esti- 
mating procedure, an allowance is made for the interaction of product cate- 
gories by simultaneously estimating the parameters in the demand and supply 
relationships of related product categories. In addition, nonagricultural influ- 
ences are introduced by selecting relevant variables from the more complete 
“master” model. 


2. CONSTRUCTION OF PRODUCT CATEGORIES 


2.1 Agriculture as a single economic sector is disaggregated into commodity 
groups for which group models are established. The procedure followed for dis- 
aggregating is based on three criteria. The first consideration is for results 
which would be useful for those persons predicting demand and supply relation- 


ships for major agricultural commodities. The second criterion involves a limit 
set by the number of structural equations which may be estimated on the basis 
of the data available. The third criterion requires that there be sufficient homo- 
geneity in the products entering each product category to permit meaningful 
supply and demand relationships to be derived. The following twelve product 
categories resulted from applying these criteria: 


1. Wheat . Poultry meats (turkeys, broilers’ 
2. Feed grains (corn, oats, barley, sorghum chickens) 
grain) . Soybeans 
3. Beef cattle and calves . Cotton 
4. Dairy products . Tobacco (flue cured and burley) 
5. Hogs . Truck crops 
6. Eggs . Miscellaneous 


The first eleven categories accounted for approximately eighty-five per cent of 
the value of cash receipts from farm marketings and government payments in 
195i. 

2.2 Annual observations on the above product categories are included for 
the 1929-1953 time period. For livestock and livestock products calendar years 
are used while for the crop categories “crop” years are considered. In those 
cases where feed grains are an input in livestock production, a three-month lag 
in the feed category is used. Because of such interactions, categories two 
through seven are fitted simultaneously as a subsystem while each of the re- 
maining categories is considered as a separate subsystem. Estimating tech- 
niques vary between product categories, but in general limited information 
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maximum likelihood methods are applied to all relationships except the supply 
relationships of crops for which least squares applications are made. All equa- 
tions in the model are specified to be linear in the original variables. 

2.3 There is a similarity between the behavioral relationships of each prod- 
uct category. In general they may be described as follows. Farm production 
involves the transformation of inputs into outputs to be marketed. Output may 
be sold commercially into non-farm channels, stored on the farm and consumed 
as an input at a later stage, or placed under government loan or purchase agree- 
ment. Since sales are seldom made directly to consumers, the price received by 
the producer is influenced by the actions of intervening wholesaling, processing, 
transporting, and retailing agencies. The government may enter the market as 
a purchaser or supplier of goods while private exports and imports also occur. 
A general identity showing the sources of distribution during a given time period 
is presented. 


Yat+ Za = Yat Yat Yoo t Yee = (1,2,---,11) (2.3.1) 


Y « = production in physical units of the ith product category. 

Y 3 =net exports. 

Y «=government demand, or amount going under loan and purchase agree- 
ment. 

Y s =commercial demand, including quantities consumed in the farm house- 
hold. 

Y ¢=inventory demand, or the carryover in private farm storage of the cur- 
rent and previous years’ crops. 

Z = beginning inventory or the carry in of previous years’ crops. 


The i subscript indicates the eleven product categories given in 2.1. Y sym- 
bols designate variables largely endogenous to the agricultural sector, while Z 
symbls represent predetermined variables. The first subscript refers to the 
product category while the second refers to the economic variable. In addition 
to the five endogenous variables presented in (2.3.1) estimates of the current 
price received by farmers for each product category are developed. 


3. SUPPLY AND DEMAND RELATIONSHIPS 


3.1 All supply relationships represent interaction of behavioral and tech- 
nological functions. This specification is necessary since data are not available 
which show the allocation of each input factor to the respective product cate- 
gories. Thus some of the variables in conventional production functions are 
unobservable. For the crop categories, supply functions are fitted in two stages. 
Production functions are specified which associate production with acreage 
planted end those factors affecting the yield on this acreage. This may be writ- 
ten as in (3.1.1). 


(Ya, Yir/Zsj, uss) i = (1, 2, 8, 9, 10, 11) (3.1.1) 


Y « = production of i-th crop category. 
Y;;=acreage planted. - 


1 Subscripts indicating current variables are omitted except where lagged variables are used i.e., [ig ]e. 
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Z;;=those input factors which affect the yield of crops and include fertilizer 
applied, machinery used, weather influences, and a linear time trend to 
account for varietal improvements, weed control, etc. 

u;; =random disturbance. 


This relationship may be read as “the production of crop 7 is associated with the 
acreage planted and those factors affecting the yield on this acreage.” The 
diagonal bar is introduced to separate the endogenous variables appearing on 
the left from the predetermined variables appearing on the right. Since more 
than one endogenous variable appears in (3.1.1), it cannot be fitted by a least 
squares procedure if unbiased parameter estimates are to be obtained. An addi- 
tional equation involving Y ;7 is introduced as (3.1.2). 


(Vir/Zin, Wie) (3.1.2) 


Z% = predetermined variables associated with acreage planted. 
uj, =random disturbance. 


This relation can be fitted by least squares. However, the procedure followed 
is to substitute the Z% and ux of (3.1.2) for Yi7 in (3.1.1) and fit the resulting 
reduced form by least squares. 


(Ya/Zis, Zin, Wij, Vin) (331.3) 


The random disturbances may be combined to form a new random term. 

3.2 For the supply relationships of the livestock product categories, limited 
information maximum likelihood fits are obtained. This procedure permits the 
measurement of interactions between the feed grains and livestock categories, 
where feed grains are regarded as an input and current livestock prices are said 
to influence current livestock supplies. A general formulation is presented in 
(3.2.1). 


(Ya, Zim, = (3, 4, 5, 6, 7) (3.2.1) 


Y production. 

Y 2= price of product. 

Y2 = production of feed grains. 

Y2= price of feed grains. 

Z;. = beginning inventory of livestock. 

Z im= other predetermined factors affecting livestock production including 
range and pasture conditions, technological improvements and the in- 
fluence of past prices. 

uj. =random disturbance. 


3.3.1 The demand relationships for the eleven product categories may be 
classified into three types. The first is termed commercial demand and includes 
the quantities of commodities consumed by persons or industries in the private 
sector of the economy. Such demand relationships appear for all product cate- 
gories. The second is called inventory demand and represents those quantities 
carried over by the private sector at the end of time period t. This relationship 
is fitted for feed grains, wheat, and cotton. The final demand relationship is 
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termed government demand and represents those quantities of commodities 
moving under government price support loan or purchase agreement programs. 
Wheat, feed grains, cotton, and tobacco represent the product categories in- 
volved. 

3.3.2 The commercial demand relationships are of two forms. A derived 
demand exists for feed grains since it is mainly consumed as an input in live- 
stock production. This demand is largely dependent upon existing relation- 
ships between feed grain prices, livestock prices, and the number of animal 
units to be fed. 


(Y 2s, Y i2/Zes, Urs) (i 2, 3, 5) (3.3.3) 


Y2; quantity of feed grains moving into commercial channels. 
Y 2=price received by farmers for i-th commodity. 

Z25= number of animal units fed on farms. 

Ux; = random disturbance. 


For the remaining categories, commercial demand is generally associated 
with (1) the current price of the product, (2) prices of close substitutes, (3) a 
measure of marketing charges, (4) per capita disposable income, (5) the general 
price level, and, in some cases, (6) a time trend to remove some of the effects of 
changing tastes. A formulation appears in (3.3.4). 


(Yis, Ze, Z49, Zn, (¢ = 1,3, 4,---, 11) (3.3.4) 


commercial demand. 

Y 2=price received by farmers. 

Y = price received by farmers for substitute commodities. 

Za = per capita disposable income. 

Zs = index of general price level for United States. 

Z.9=index of average hourly marketing charges for food weighted by the 
wholesaling, retailing, transporting, and processing functions. 

Zs, = linear time trend. 

Uys = unobservable random disturbance. 


Although Zs and Zs. are listed as predetermined variables to this model, 
they are treated as endogenous variables within the Klein-Goldberger master 
model. Disposable income in the master model is equal to the sum of private 
and government employee compensation, nonwage nonfarm income, and farm 
income, minus personal, payroll, and corporate taxes exclusive of transfer pay- 
ments and corporate savings. Annual income figures were divided by corre- 
sponding population estimates to arrive at per capita disposable income. The 
master model was solved for Z,; and the computed, rather than observed, values 
used as the per capita disposable income variable in the agricultural model. 
This derivation links forecasts of the master model with forecasts of the agri- 
cultural sector. Price level estimates are included in the same manner. 

3.3.5 Inventory relationships are estimated for the wheat, feed grains, and 
cotton product categories. The relationships are known to be rather naive but 
represent an improvement over treating the year end carryover as an exogenous 
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variable. In general, carryover is associated with current production plus 
carry in, current prices and expected supply. 


Y x2, Ya*/Zicy Uis) (1, 2, 9) (3.3.6) 


Y «=private inventory carryover. 

Y 2=current price. 
Y = production plus beginning inventory. 
*Zic= expected supply. 

uieg = random disturbance. 


3.3.7 Government demand is defined as the quantity of each commodity 
moving under government loan or purchase agreement, and in the model in- 
volves the feed grains, wheat, cotton, and tobacco product categories. The 
amount of a commodity moving under loan is said to be determined by the dif- 
ference between the national average support price and what the “free” equi- 
librium price would be in the absence of government intervention. 


Yir = ai[Zis — Ya*] + ui’ i = (1, 2, 9, 10) (3.3.8) 


Y,7/ =government demand. 

Y »* = “free market” equilibrium price. 
Ze =national average support price. 
=random disturbance. 


Since the Y* are hypothetical data not observable in the market (3.3.8) can- 
not be estimated; hence a transformation is made for Y j.* in terms of observable 
variables. 


Yio = + Ya* + + Ue = (1, 2, 9, 10) (3.3.9) 


Y = observable market price 
Yjz=government demand. 
Y =available supply. 
Zia=factors on the demand side of the market affecting price. 
uj=random disturbance. 


If Y;7=0 in (3.3.9), Yi2 represents the “free market” equilibrium price. Thus 
- equation (3.3.9), where Y;7=0, is substituted for Y;* in (3.3.8). The resulting 
equation, where all data are observable is presented in (3.3.10). The random dis- 
turbance u;7 is a linear combination of u;7/ and uj. 


(Yix, Ya*/Zie, Zia, Uiz) = (1, 2, 9, 10) (3.3.10) 


4. PARAMETER ESTIMATES OF THE MODEL 


The parameter estimates of the demand and supply relations are given below. 
Standard errors appear below their respective parameter estimates in paren- 
theses. Equations estimated by least squares are designated by LS while those 
estimated by maximum likelihood, limited information are indicated by LISE. 
In the case of LS estimates the coefficient of multiple determination, adjusted 
by the number of degrees of freedom, is presented with the particular equation. 
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Following the estimated relations for each subsystem is a definition of the in- 


cluded variables. 
4.1 Wheat 
Production, estimated by LS with R? =.73 


Yu = — 105.181 + 5.46721; — .370Z + 3.711Z4 + 2.632[Vi2]1 


(3.161)  (.518) (1.076) (.909) 
Commercial demand, estimated by LISE 
Yis = 776.650 — 2.615Yi. — .090Yn — .624Z49 + 1.190Z 4 (4.1.2) 
(1.137) (.163) (. 134) (.294) 
Inventory demand, estimated by LISE 
Yie = 113.619 + .266Yun* — .969V — .099Z15 (4.1.3) 


(.113) (.296) (.315) 
Government demand, estimated by LISE 
Yi7= —585.06+.474Y1* +.107Z21+.386Z 49 — .650Z — .603Z36+2.799Z 16 
(.180) (151) (211) (804) (3.278) (1.252) 
Identity used to complete the subsystem 
Yu + — Zia = Yis + Yie + Yar = Yu* (4.1.5) 


wheat production, in million bushels. 

Yu* =supply of wheat available for utilization, in million bushels. 
Y12.=average market price of wheat to growers, in cents per bushel. 
Yys=commercial demand for wheat, in million bushels. 

Yis=inventory demand for wheat, in million bushels. 
Yi7=wheat delivered to Commodity Credit Corporation, in million 
bushels. 
Z=fertilizer applied in North Dakota and Kansas, in thousand tons. 
Z13=seeded acreage of wheat for the previous year, or announced allot- 
ments when in effect, in million acres. 
Zu=index of weather influence for wheat areas (1943 = 100). 
[¥i2]_1=higher of price of wheat for the previous year or current year sup- 
port price, in cents per bushel. 
Ya = production of feed grains, in hundred thousand tons. 
Z« = per capita disposable income for the United States, in dollars. 
Zs = average hourly marketing cost for food products. 
Zs = acreage planted to winter wheat, in hundred thousands. 
Zs = price level index for United States, 1939 = 100. 
Zs. = national average support price for wheat, in cents per bushel. 
Ys: = beginning inventory of wheat not under CCC loan, million bushels. 
Z4=net exports of wheat, in million bushels. 


4.2 Feed Grains 
Production, estimated by LS with R? =.86 


Z 
ay, 
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= 583.710 — 4.606 Fa + 6.882Zm + 10.987Z.7 


22-3 


(2.248) (.947) (6.395) 
Commercial demand estimated by LS 
Yo3= —3119— . 900 .124V254+1.133 .904Z25+ .271 Y 
(.915)  (.643) (.796) (1.160)  (.570) 
Inventory demand, estimated by LISE 
Yos = — 548.325 — .149V22 + 1.739Za 
(.085) (.307) 
Government demand, estimated by LS with R? = .60 
Yos = 1326.905 + 1.686Z26 — .0335Yx, — 8.434Y 
(.483) (.336) (4.221) 
— .184Y5 — 1.16022, + .796Zn 
(.464) (1.054) (.955) 


(4.2.2) 


(4.2.4) 


Identity used to complete the feed grains product category 


Zu = Yu_, + Yos_, — Zea = Yos + You + Yor (4.2.5) 


4.3 Beef Cattle 
Production estimated by LISE 


Ys = — 10073.262 + .528Y + 129.239Z3,; + 11.090Z;, — 3.502Z2 
4.3.1 
(.645) (66.745) (1.815) (3.213) 
Demand, estimated by LISE 


Ys = 687.789 — .094Ya + 1.000Vs2 + .444Z4 + .547Z 49 


(.025) (.262) —(.693) 


4.4 Dairy Products 
Production, estimated by LISE 
Y 4 =198.188+ .869Y .116Yo2+ .598Z 434+ 1.9697 444+2.425Z 45 
(.212)  (.028) (.285) (.708) (.510) 
Demand, estimated by LISE 
Y =826.594 — Y +.246Z 4 —.096Z 49 — 1.8342 2+ .074Z 46+ .088Z 45 
(.121) (.085)  (.040) (.701) (.026) (058) 


(4.4.1) 


4.5 Hogs 
Production, estimated by LISE 


Yur = — 320.656 + .181 + .344¥2 + .204Z + 12.845Z55 
(.083) (.258) (.448) (3.859) 


- 
(4 .2. 1) 
(4.2.3) 

4 
‘ 

| 

| 

(4.4.2) 
(4 1) 
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Demand, estimated by LISE 
Ye = —371.386— .304 Yu — .059 Y32+68 .959 Yrt 054Z 4 ‘ 513Z 49 


(.221) (.172) (19.230)  (.887)  (.512) 
4.6 Eggs 
Production, estimated by LISE 
Yo = 46.56 — .063Y22 + .669Z6; — .641Z65 + 3.663 Y (4.6.1) 
(.038) (.094) (.183) (1.838) 
Demand, estimated by LISE 
Yeo = 19.258 — .0621Ye + .0524Y 2+ .0493Z, — .0161Z49 (4.6.2) 
(.076) (.058) (.041) (.012) 
4.7 Poultry meats 
Production, estimated by LISE 
Yn = 299.844 — .226Y2 — 646Z65 + 6.329Z73 + 10.427Y 2_, (4.7.1) 
(.039) (.309) (.673) (2.431) 
Demand, estimated by LS with R? =.87 
Yr = 11.32 — .0173¥n + .0077Y + .0107Z 4 (4.7.2) 
(.017) (.002) (.008) 
Y», = production of feed grains in year +1, in hundred thousand bush- 
of feed grains (1910-14= 1000). 


Y23= commercial demand for feed grains, in tens of thousand tons. 
Y»=government demand for feed grains, in tens of thousand tons. 
Y2;=inventory demand for feed grains, in tens of thousand tons. 
Ys: =liveweight slaughter of beef, in million pounds. 

Ys2.= price of beef, in cents per hundredweight. 

Ya =production of milk, in million pounds. 

Y42=price of milk, in cents per hundredweight. 

Ys:=liveweight slaughter of hogs, in tens of million pounds. 

Ys2= price of hogs, in cents per hundredweight. 

Yo: = production of eggs in hundred millions. 

Ye2= price of eggs, in cents per dozen. 

Yn= production of poultry meat, tens of million pounds. 
Y7=price of poultry meat, in cents per pounds. 

Zu =available supply of feed grains. 

Ze = weather index for feed grains (1947-49) = 100. 


Yu __ average weighted price ratio of wheat to feed grains lagged up to 
| three years. 


Zx=animal units fed beginning October 1 year t—1, in hundred 
thousands. 


i 
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= annual average support price for corn index (1910—-14= 100) 

Z2;=fertilizer nutrients applied in North Central States, in hundred 
thousand tons (square root of variable taken). 

Zs = index of range condition for May—October, 100 = normal. 

Z3s= January 1 inventory of steers and calves, in tens of thousands. 

Zs = general price level in United States, in 1939 constant dollars. 

Za = per capita disposable income in United States, in dollars. 

Z=ratio of milk used for manufactured vs. fluid uses. 

Zs3=number of Dairy Herd Improvement Associations operating on 
January 1, X10. 

Zu = pasture condition as a per cent of normal. 

Zy;= January 1 inventory of cows and heifers, two years old or over, in 
hundred thousand head. 

Z=net exports of milk plus net inventory change, in tens of million 
pounds. 

Z43= production of oleomargarine, in million pounds. 

Zs = average hourly marketing charge for food products, weighted by 
the retailing, wholesaling, transporting and processing functions, 
in tenths of cents per hour. 

Zss=January 1 inventory of sows and gilts, in hundred thousands. 

Ze3=number of birds in National Poultry Improvement Plan, in 
hundred thousands. 

Zes= January 1 inventory of hens and pullets, in millions. 

Z73= index of physical units of poultry equipment on farms in hundred 
thousands. 

[ Yeo ]_1=price eggs on December 15 of previous year, in cents per dozen. 
[Yz2|1=price of broilers during previous year, in cents per pounds. 


4.8 Soybeans 
Supply, estimated by LS with R?=.90 


Y 
Ya = — 1513.726 + 7.545 Fa + 3.394Zs; + .656Zs4 
(4.098) (.280) (.502) 


Price, estimated by LISE 
Y = — 146.636 —.0725Y 55+ .0345Y 93’ + .2218 Y 94 +.0598Z25+.1919Z 4 
(.0183) (.0157) (.0640) (.0283)  (.0722) 
Yss=aYu — .0082Z 35 
(.0474) 


(4.8.2) 


Ys: = production of soybeans, in hundred thousand bushels. 
Ys.=price of soybeans, in cents per bushel. 
Ys;= amount of soybeans crushed. 


Fa = price ratio of soybeans to corn for the previous year. 
22j—1 


1 
‘ 
iG 
j 
4 
€ 
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Zs3 =number of combines on farms, in thousand units. 
Zs = index of weather for soybeans. 
Zs = general price level in 1939 constant dollars. 
Za =per capita disposable income, in dollars. 
Zws=animal units fed beginning October 1, year ¢—1, in hundred 
thousands. 
Yo; = price of cottonseed oil, in cents per pounds. 
Y 9’ = price of cottonseed meal, in cents per ton X10~. 
a=per cent of total production which was crushed for the previous 
year. 
4.9 Cotton 
Supply, estimated by LS with R?=.70. 


Yo. = 10892.674 + 2.622[Zo10|-1 + + 71.248Z01 
(.597) (1.572) (28.299) 


— 20.585Z9 — 87.771Z9s 
(4.787) (72.828) 
Commercial demand, estimated by LISE 
Yos = 11387.812 — 12.446 + .112Z 95 + 9.01424 — 10.169Z (4.9.2) 
(9.029) (. 162) (3.958) (2.909) 
Inventory demand, estimated by LISE 
Yos = 637.060 — 4.096Y + .295Ya* — .393Z95* (4.9.3) 
(4.469) (. 156) (1.239) 
Government demand, estimated by LISE 
Yor = — 7020.920+  .610Yo* + 1.599Z95* + 1.310Zo9 
.150 1.424 3.601 
(12.770) (4.998) (4.064) 


identity to close the subsystem 
Ya* = Yo + Zoe + Zor = Yos + Yoo + Yor + Zor + [Zor] it (4.9.5) 


Y 9: = production of cotton, in thousand running bales. 
Y,* =available supply of cotton, in thousand running bales. 

Y 92 = average price of cotton to growers in tenths of cents per pound. 
‘ Yo5= mill consumption of cotton, in thousand running bales. 

Yo =ending inventory of cotton not under loan, in thousand running 

bales. 

Yo97=government demand for cotton, in thousand running bales. 
Za=per cent of cotton grown in Far West, acres as per cent of 1000. 
Zo = cotton yield decline due to climatic factors, as per cent of 1000. 


he 
it: 
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Z 3 = prices paid for production inputs by cotton growers, 1947-49 = 100. 
Zo,=net exports of cotton, in thousand running bales. 
Zo;* =foreign supply of cotton, in tens of thousand running bales. 
Zs = private stocks of cotton on July 1, in thousand running bales. 
Zs1=CCC inventory of cotton on July 1, in thousand running bales. 
Zog= average support price for cotton, in tenths of cents per pound. 
Zs9= production of synthetic fibers, in million pounds. 
Zs = general price level, in 1939 constant dollars. 
Za=per capita disposable income, in dollars. 
[ Ye ]_1=higher of support price or market price for previous year, in cents 
per pound 
[Z>,10]_,=higher of cotson acreage allotments or last years acreage grown, in 
tens of thousand acres. 


4.10 Tobacco 
a. Flue cured tobacco 
Production, estimated by LS with R?=.80 


= 318.459 + 1.589Z10,4 + .228Z101 — .207Z.0,7 
(.187) (. 162) (.113) 
Commercial demand, estimated by LISE 
VY 10,6 = 2451.793 — 15.024 Vio 2 +2.611Z 4: — 5.23221, 12+ 173.382Z 10,6 
(12.833) (2.098) (4.360) (154.053) 
Government demand, estimated by LISE 
Yi0,4 = 2750 —3.561 Y10,1+2.0397Z 10,3 + 1.0662 10,12 4-8.509Z —5.823Z 56 
(2.311) (5.940) (2.590) (3.419) (2.396) 


b. Burley tobacco 
Production, estimated by LS with R?=.83 


Yioa’ = — 12.260 + + 3.189Z10,6 + 
(.047) (.029) (.014) 
Commercial demand, estimated by LISE 
Y 10,3’ = 765.698 — 1.6735 10,2’ +.360Z 41 — 2.012 10,19’ +21.048Z 10,6 
(.507) (.148) (.612) (8.337) 
Government demand, estimated by LISE 
Yi0,4 = —314.265+.791 —.199Z +.610Z 10 12’ +.183Z 10,3’ 
(.129) (.073)  (.198) (.146) 
Identities to complete the systems 
Yioa = + 
Yio.’ = Yiox’ + 


(4. 


(4.10.1) 
(4.10.2) 
(4.10.3) 
(4.10.4) 
(4.10.5) 
10.6) 
(4.10.7) 
: (4.10.8) 
‘ 
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production of flue cured tobacco, in million pounds. 
Y10,2= average price of flue tobacco, in tenths of cents per pound. 
Yio.4=government demand for flue tobacco, in million pounds. 
Y10.=commercial demand for flue tobacco, in million pounds. 
Yo,’ =production of burley tobacco, in million pounds. 
Y 10,2’ =average price of burley tobacco, in tenths of cents per pound. 
Yio,3’ = commercial demand for burley tobacco, in million pounds. 
Yio.’ =government demand for burley tobacco, in million pounds. 
Z1o.1=higher of acreage allotments or previous years acreage harvested 
for flue tobacco, in thousand acres. 
Z.0,3 = average support price for flue cured tobacco in tenths of cents per 
pound. 
Z10.4= higher of announced support price or previous years market price 
- for flue cured tobacco, in tenths of cents per pound. 
linear time trend, 1929=1. 
Z1o,7= June rainfall in Raleigh, North Carolina, in hundredths of inches. 
Z1o,.2=ratio of manufacturers stock to disappearance for flue cured 
tobacco. 
Z'\o.2=higher of acreage allotments or last years acreage harvested of 
burley tobacco, in thousand acres. 
Zi 3 = average support price for burley tobacco, in tenths of cents per 
pound. 
Zo 12= ratio of manufacturers stocks to disappearance for burley tobacco. 


[Yio.2]_,.=higher of support price or market price for previous year for 


burley tobacco, in tenths of cents per pound. 
Zs = general price level, in 1939 constant dollars. 
Z.,=per capita disposable income, in dollars. 


4.11 Vegetables 


Fresh vegetables 
Production, estimated by LS with R?=.93 


Yua = 605.543 + + 2.546Zn1 
(.497) (.422) 
Demand, estimated by LS with R?=.91 
Yue = 235.345 + 1.010Z4 — .040Yua — .276Z 
(.135) (.245) (.082) 


(4.11.1) 


(4.11.2) 


b. Processed vegetables 


Production, estimated by LS with R?=.71 
= 46.120 + 6.991[Yiv ola + 1.693¥ 
(2.600) (.755) 
Demand, estimated by LS with R?=.97 
= 46.793 + .484Z4 — .011Yiv — .134Z 49 
(.050) (.007) (.028) 


(4.11.3) 


(4.11.4) 


WAS 
= 
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Yu,1=production fresh vegetables, in thousand tons. 
Yu.2=price fresh vegetables, in tenths of dollars per ton. 
= production processed vegetables, in thousand tons. 
Yw .2=price processed vegetables, in tenths of dollars per ton. 
Z1,1= acres harvested of fresh vegetables for the previous year, in thousands. 
Zw 1 = production of processed vegetables for the previous years in thousand 
tons. 
Za=per capita disposable income, in dollars. 
Z4g = average hourly marketing cost, in tenths of cents per hour. 


5. AGGREGATIVE ECONOMIC MEASURES 


The model is constructed with the purpose of aggregating economic measures 
of agricultural activity. Prices received by farmers are estimated for each prod- 
uct category. From these price series is constructed a fixed-weight aggregative 
index of prices received by farmers for the products included in the eleven 
product categories. The construction of the index is done in the following 
manner 

(Via) = (1,2,---, 1) 


Py (5.1) 


= (1929, 1930, - , 1953) 


P,,=index of prices received for year f. 
(Y 2):=price received for category 7 during time period ¢. 
(¥iq)o= average quantity of category 7 sold during base period 0. 
(Y 2)o=average price received for category 7 during base period 0. 
Base weights represent the period 1937-41. Similarly, an index of physical 
production is derived as in (5.2) 


(Ya)o(Ya)o = (1929, 1930, - - - , 1953) 


Q: (5.2) 


(Ya):=production of category 7 during time period ¢. 

(Yi1)o=production of category 7 during base period 0. 

Base weights represent the period 1947-49. 

Given the prices received by farmers and the production for each product 
category, gross farm income R is estimated as follows: 


11 
R= (Va: Ya) (5.3) 
i=1 
R=gross farm income from production. 
ri2=farm income from category twelve. 

and are defined above. 

To arrive at estimates of farm production expenses for each product category 
it is necessary to have data on the distribution of production inputs and prices 
paid for them by product categories. Such data are fragmentary and insufficient 
for the purpose. Consequently, aggregate production expenses are estimated. 
Production inputs may be classified by source into four general categories. 
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(a) farm expendables, 

(b) non-farm expendables, 

(ec) non-farm durables, and 

(d) services, including taxes, interest and wages. 

Ideally, the construction of an index of prices paid by farmers for items used in 
production would involve quantity weights, and commodity price data gener- 
ated by a set of structural relations describing the agricultural and non-agri- 
cultural sectors of the economy as a unit. Since such a model is not available, 
the price data used in constructing the index of prices paid are considered to be 
endogenous for the category of farm expendables and exogenous for the remain- 
ing three input categories. 

Production expenses are, of course, the physical quantities of inputs or serv- 
ices multiplied by their respective prices. 

E = (P,-1) (5.4) 
E=dollar value of production expenses, in million dollars. 
P,,=index of prices paid by farmers for items used in production, 1910-14 
= 100. 
I =physical quantities of inputs used in production. 

In the absence of data on J and the corresponding prices included in P, an 
estimating equation for E is constructed substituting the index of physical 
production, Q, for J. The logic of the procedure is the assumption that Q is a 
direct function of J. 

Q = bi or, (5.5 
I = (5.6) 
The expression in (5.6) is substituted for J in (5.4) which is fitted by a least 


squares procedure. Since there appears to be no reason for forcing the regression 
line through the origin, a constant term was included to give a linear fit. 


E=a+t+b"(Q:P,) =u (5.7) 
Least squares estimates are given in (5.8) where R?=.98 


E = 788.700 + 7.2867(Q — P,) 


5.8 
(2.1695) 

Net farm cash income is estimated as the difference. 
R-—-E=WN (5.9) 


The estimates of N could be made comparable with Department of Commerce 
figures on net income of farm proprietors by adding to N income from addi- 
tional sources. These include government payments made directly to farmers, 
gross rental value of farm dwellings, and the net change in farm inventories. 


6. EVALUATION OF THE MODEL 


6.1 One way of evaluating the theory and methods of estimation in the 
econometric model is to compare forecasts outside of the sample period with 
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subsequently observed values of the same variables. The model is tested for the 
years 1954 and 1955 by predicting the values of all endogenous variables and 
using these to construct the aggregative measures outlined in the preceding 
section. 

Table 571 presents the predicted and observed values of the aggregates for the 
years 1954-1955. 

The prices of almost all products are over-estimated in the prices received 
index. This is primarily due to the inclusion in the demand equations of price 
level variables which are also over-estimated in the Klein-Goldberger model. 


TABLE 571. PREDICTED AND OBSERVED? ESTIMATES OF AGGREGATES 
1954, 1955 


1954 1955 
Aggregate 
Predicted Observed Deviation | Predicted Observed Deviation 


Prices rec. 
(1937-41 = 100) 260 234 249 + 27 
Production 
(1947-49 = 100) 107 110 107 —- 4 
Prices paid 
(1910-14 = 100) 308 286 296 ¢ + 6 
Prod. expenses 
(mil. dol.) 23 , 225 21,410 +1,815 22 ,289 21,570 +719 
Gross income 
(mil. dol.) 35,980 31,397 +4,583 30 ,623 31,032 —409 
Net income 
(mil. dol.) 12,755 9,987 +2,768 9,053 9,462 — 409 
2 Observed estimates will differ from official published estimates due to fewer commodities being included in 
the model estimates, different weighting procedures and the exclusion of inventory changes. 


In the aggregate index of prices received, the major part of the deviation is 
caused by over-estimating cattle prices and cotton prices. Correct estimates for 
these two would have reduced the deviation to seven index points in 1954, and 
correct estimates for cotton alone in 1955 would have reduced the deviation to 
12 points. The direction of change is correct between the two years and the 
percentage reductions in the predicted and observed indexes of prices received 
are almost identical. Physical production is slightly under-estimated which 
also causes an over-estimation of prices received by farmers. It is likely that 
similar circumstances cause deviations in all predicted aggregates, i.e. an over- 
estimation of feed grain prices, the consequent under-estimation of beef pro- 
duction and the accompanying over-estimation of beef prices. 

Feed grain prices and cattle prices cause the over-estimation in the prices 
paid index. The over-estimation of gross farm income is a result of the over- 
estimation of prices received by farmers. The under-estimation of production is 
not great enough to balance the error in prices when computing income. The 
same is true of net farm income which is calculated as the difference between 
gross income and production expenses. 

The above process of prediction is purely mechanical and it is believed that 


; 

| 

> 
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fairly reliable point forecasts can be made if external knowledge is also relied 
upon during the estimating process. 

6.2 A second means of evaluating the model is to estimate point elasticities 
and compare these with other studies for consistency and plausibility. Those 
values which appear to be widely divergent from “accepted” elasticities are 
evaluated for possible sources of error. Table 573 includes the estimated elastici- 
ties, computed at their mean values. 

The estimated income elasticity for wheat is obviously too large. Even allow- 
ing for all population increases in the per capita income variable, one would 
expect the elasticity to have a value of 1.0 as a maximum. The demand rela- 
tions for flue cured tobacco appear to have large errors associated with them and 
reservations should therefore be placed upon the elasticities computed from the 
relations. 

In general, the price elasticities of supply appear to be more reasonable than 
do the demand elasticities. 

6.2 Two further statistical tests were conducted although both have limita- 
tions when applied to relations estimated by the limited information, maximum 
likelihood procedure. These are the Durbin-Watson test for serial independence 
of the disturbances and the ¢ test to determine if the estimated parameters dif- 
fer significantly from zero. 

Of the thirty-five estimated relations, eighteen were such as to accept the 
hypothesis that there was serial independence of the disturbances. In sixteen 
cases the test was inconclusive and in one case the hypothesis of serial inde- 
pendence was rejected. 

While the ¢ test has been developed for least squares estimates, it also forms a 
rough measure of the reliability of parameters computed by the limited in- 
formation technique. By observing whether the ratio of the parameter to its 
standard error is greater than 1.72, a rough indication of the reliability of the 
parameter estimates of the model is possible. Of the 134 parameter estimates 
81 were significantly different from zero at the 5% probability level using a 
one-tailed test. 

An alternative to the ¢ test is to retain parameters whenever the parameter 
sign supports the logic used in constructing the equation, even though a sta- 
tistical test may not reject the hypothesis that the parameter equals zero. This 
procedure is followed because the ¢ test is not designed for limited information 
techniques, and more importantly, while a parameter estimate may not appear 
to be significant, the cause may be errors of observation, improper functional 
form, or the presence of collinearity between supposedly independent variables, 
instead of the irrelevance of the variable. 


7. SUMMARY 


The primary purpose of the estimated model is to measure major relation- 
ships within agriculture and between agriculture and the remainder of the 


3 Preliminary reformulations of the model for wheat indicate an i lasticity of d d equal to 0.22 
and a price elasticity of demand approximating —0.26. In the reformulaticn a separate demand equation for wheat 
fed to livestock is estimated rather than combining it in the total food demand as does the current model. 


P 
od 
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TABLE 573. ESTIMATES OF ELASTICITIES 


Variable of Price elasticity Price elasticity |Income elastic-| Price flexibil- 
Normalization of supply | of demand ity of demand | ity of income 


Wheat 
Production 0.370 
Commercial demand 
Inventory demand 
Government demand 


Feed Grains 
Production 
Commercial demand 
Inventory demand 
Government demand 


Beef Cattle 
Production 
Price 


Dairy 
Production 
Price 


Hogs 
roduction 
Price 


Poultry Meat 
Production 
Price 


Eggs 
roduction 
Price 


Soybeans 
Production 
Price of oil 
Price of meal 


Cotton 
Production 
Commercial demand 
Inventory demand 
Government demand 


Flue Tobacco 
Production 
Commercial demand 
Government demand 


Burley Tobacco 
Production 
Commercial demand 
Government demand 


Fresh Vegetables 
Production 
Price 


Processed Vegetables 
Production 0.416 
Price —5.714* 1.510 


* Indicates reciprocal of price flexibility of demand. This approximates the price elasticity of demand. 


— = 
—0.518 
| 0.601 1.426 
oe | 3.677 
| | 
| —1.776 
5.934 
| —0.752s 0.311 
—0.674° 0.782 
—2.370° 0.038 
0.395 
0.298 
—1.036 1.409 
5.830 
—0.483 
0.361 
—0.300 0.953 
We —0.211 
1.252 
0.516 
es. | | 2.674 | | 
0.381 | 
—1.325 | 0.767 | 
1.553 | | 
| 
| | 1.706" 1.684 
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economy. Its usefulness may only be determined by its repeated application as 
a forecasting device. Such forecasting may be done at two levels—the product 
or category level where producer decisions are made and the national aggregate 
level where policy and interindustry effects are felt and appraised. One of the 
advantages of the model is its susceptibility to the construction of multipliers. 
It is possible in this way to trace secondary effects through particular subsys- 
tems, or to observe the effects of non-agricultural variables on agriculture as 
they enter through the Klein-Goldberger model. 

No attempt is made to develop specific areas of applicability. The areas of 
producer outlook, policy formation, interindustry studies and consumer in- 
formation all appear to be fertile areas in which application could be made. 
Testing, reformulating and extending the model in such areas are necessary if 
it is to become a useful research and extension tool. 
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A NOTE ON THE RELATIONSHIP BETWEEN EARNING 
EXPECTATIONS AND NEW CAR PURCHASES 


Peter E. pE JANOosI 
Standard Oil Co. (N.J.) 


Is the probability of a new car purchase greater among those spending 
units that expect to make more money than among those expecting 
to make less? Surprisingly, in a sample of American spending units 
those expecting to make less actually turned out to be the more fre- 
quent purchasers. Exceptionally high incomes, special trade-in habits 
and incorrect expectations are suggested as possible explanations for 
this paradoxical finding. 


1, INTRODUCTION 


YPICALLY, in studies of consumer demand for a particular commodity, or 
ae of commodities, a large number of variables have been found im- 
portant. Such variables include income, price, and in the case of durable goods, 
a stock variable. More recent demand studies have employed expectational or 
attitudinal variables based on cross-section data. With the help of the expecta- 
tional variables the likely demand for selected commodities can, in some in- 
stances, be assessed more accurately than is possible otherwise. Supporting 
evidence has been published widely by numerous authors.' However, the pres- 
ent note describes one case where curious results were obtained by the use of an 
expectational variable. 


2. THE FINDING 


The variable discussed here deals with the relationship between earning-rate 
expectations and new car purchases.” Specifically, the question raised is this: 
are families that expect to make more money a year hence than a present more 
likely to buy a new car during the intervening period than other families? On 
the basis of a priori reasoning as well as other automobile demand studies, we 
would expect the answer to be in the affirmative. 

The question posed can be examined empirically with data collected by the 
Survey Research Center for the Federal Reserve Board. For the year 1952, 
two sets of interviews of identical spending units are available. The first round 
of interviews was conducted during January and February 1952, with the 
second round following a year later. The sample of respondents is not a repre- 
sentative sample of American spending units, but this is not likely to affect the 
finding.* 

The question concerning expected earning rates was asked in January—Feb- 


1 For example: L. R. Klein and J. B. Lansing, “Decisions to Purchase Consumer Durable Goods,” Journal of 
Marketing, 20 (1955) 109-32; E. Mueller, “Effects of Consumer Attitudes on Purchases,” American Economic 
Review, 47 (1957) 946-65; G. Katona and E. Mueller, Consumer Attitudes and Demand 1950-1952, Survey Research 
Center, University of Michigan, 1953. 

2 This note is based on the author's Factors Influencing the Demand for New Automobiles; A Cross-Section 
Analysis, unpublished Ph.D. dissertation submitted to the University of Michigan, 1956. Apart from the case 
mentioned here, the overall study found strong evidence of the usefulness of expectational and attitudinal variables. 

4 For further details about the data, see de Janosi, op. cit., Ch. 2. 
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TABLE 576a. PROPORTION OF NEW CAR PURCHASERS WITHIN 
EXPECTED EARNING RATE GROUPS 


Spending Units That Expect to Make 


More Money a Less Money a 
' Year Hence The Same Year Hence 
Proportion of New Car Buyers -08 .07 .18 
Number of Cases 324 425 71 


ruary 1952 and reads as follows: “How about a year from now—do you think 
you people will be making more money or less money than you are now, or 
what do you expect?” A year later the same respondents were asked, among 
other things, whether they had bought a new car during 1952. 

Table 576a shows the proportion of spending units that purchased a new car 
within different earning-rate expectations groups. The differences in the pro- 
portions are substantial. A chi-square test results in a x? of 9.58 which is sig- 
nificant at the .01 level. However, contrary to expectations the proportion of 
new car purchasers is largest among those expecting to make less. What possi- 
ble explanation, apart from chance occurrence can be found for this unusual 
result? 

Since the su:"ri-ing element in Table 576a is the high frequency of new car 
purchases in t tive expectations group, the procedure to be followed will 
be to examine in uunute detail the 13 (18 per cent of 71) respondents in that 
group. These few spending units “cause the problem.” 

The first striking fact about these 13 spending units is that 6 had a disposable 
income of more than $15,000, 6 had a disposable income between $5,000 and 
$10,000, and only one had less than $5,000. A comparison with the income 
distribution of the total sample and the income distribution of all new car 
purchasers reveals (Table 576b) that unusually many of the 13 spending units 
were enjoying higher incomes. It is perhaps reasonable to assume that the 
higher-income units could afford to ignore their earning-rate expectations in 
making automobile purchases. 

A tentative explanation, related to the fact that these spending units belong 
to the higher-income brackets, is that all but one spending unit traded-in a car 
of recent vintage on the new car purchased in 1952. This suggests that these 
people might trade-in a car at regular intervals regardless of their particular 
needs and economic situations. 


TABLE 576b. DISTRIBUTION OF ALL SPENDING UNITS AND NEW 
CAR PURCHASERS WITHIN DISPOSABLE INCOME GROUPS 


Under | $5,000- | $10,000-| $15,000 Total Number 
$5 ,000 9,999 14,999 | and over ota of Cases 
| 
All Spending Units 03 1.00 820 
All New Car Purchasers) .42 | .35 .09 1.00 69 
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Yet another approach is to determine whether the 13 spending units’ earning 
expectations were correct. Of the 6 with incomes above $15,000, 4 made less 
(their expectations were correct), and 2 made more (their expectations were 
incorrect). Of the 7 with incomes under $10,000 only one made less; the others 
had unexpected increases in earning rates. Hence, the unanticipated increase in 
earning rate may have induced some of these spending units to purchase a new 
car.‘ 

3. CONCLUSION 


In conclusion, it seems clear that by a detailed examination of the data sev- 
eral possible explanations can be found for the unusual finding presented in 
this note. The important point, however, is that this expectational variable 
creates serious problems in a study of factors influencing the demand for new 
automobiles, and may do so in the case of other goods also. 


4 Of those expecting to make the same, about 41 per cent were incorrect in their anticipation, and made more. 
The frequency of new car purchasers in this group is also higher than among the others expecting no change in 
earning rate. 
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THE ACCURACY OF CENSUS LITERACY STATISTICS IN IRAN* 


CHARLES WINDLE 
Pacific State Hospital 


The accuracy with which Iranians’ literacy can be obtained from a 
census was checked against later test performance in a sample of 464 
Iranian Oil Company drivers. Mean square contingency coefficients of 
.64 and .63 were found between test results and self-reported literacy in 
English and Farsi. A relatively low level of skill was sufficient for 
Tranians to classify themselves literate in Farsi, but a higher level of 
skill was required for them to classify themselves literate in English. 


1. INTRODUCTION 


TATISTICS on literacy are usually obtained from national population censuses 
S in which enumerators classify individuals’ responses. Unfortunately, such 
data are subject to great unreliability, primarily because of difficulties and lack 
of uniformity in defining literacy.' In addition, especially in undeveloped coun- 
tries where census purposes are poorly understood, answers to census questions 
are likely to be influenced by suspicion and desire for prestige.? 

There have been several instances in which population literacy estimates 
have been based upon tests of reading and writing ability.* Generally, this ob- 
jective approach yields more reliable and more detailed information than does 
simple census declaration, but it is too expensive to be generally applicable. A 
recent study has suggested that the meaning of census literacy data can be 
ascertained by testing a sample of the enumerated population.‘ The present 
study is an attempt to check the accuracy of census information on the literacy 
of Iranian oil industry employees by using the technique of post-enumerative 
testing. 

2. METHOD 


In a census of all Iranian employees of the Iranian Oil Exploration and Pro- 
ducing Company during the winter of 1955, Iranian interviewers asked each 
employee whether he could speak, read, and write Farsi, the national language 
of Iran, and English, the language used for most official business within the 
Company. Census interviewers themselves filled in the census forms, classifying 
employees according to their responses as: 

0—None (does not speak, read, or write the language). 

1—Speaks only (speaks, but does not read or write the language). 

2—Reads and writes only (reads and writes, but does not speak the language) 

3—Speaks, reads, and writes. 
Reinterviews of 250 employees after 2-4 weeks (by different interviewers in 
four-fifths of the cases) revealed disagreement of 6.0% over all classifications 
on both the English and the Farsi literacy questions. 


* The views expressed in this article are those of the author und do not represent the policies of the Iranian Oil 
Operating Companies. 

1 UNESCO, Progress of literacy in various countries. Paris, 1953. 

* Crary, D. D., “The villager” in S. N. Fisher (Ed.), Social forces in the Middle East. Ithaca, N. Y.: Cornell 
University Press. 1955. 43-59. 

* See UNESCO, World illiteracy at mid-century. Paris. 1957. 23-7. 

4 Zarkoviec, 8. 8., “Sampling control of literacy data.” J. Amer. stat. Assn., 49, 1954, 510-9. 
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Six months after the census 464 Iranian Company drivers were given a brief 
individually-administered test to determine their literacy more accurately. This 
test consisted of two series of cards, one series in Farsi and one in English, on 
which were written clearly by hand: 

1. Four three- and four-digit numbers. 

2. Three simple words or phrases such as “read,” “extra,” and “How much?” 

3. Two simple sentences such as “Your attention is required.” 

4. One complex sentence. 

Subjects were required to read the material on these cards aloud. Numbers 
could be read digit by digit. Interpretations of the meaning of material were 
requested only when reading was labored or words were mispronounced. In 
addition, on the first two cards subjects had to write two three-digit numbers 
and two simple words, respectively. Subjects were considered adept at that 
level of skill at which they made no more than one mistake. 


3. RESULTS 


The census and test results of the Company drivers are shown in Tables 
580a and 580b. 

1. Farsi Literacy. Since all drivers spoke Farsi, there were only two classes 
of census response—“speaks only” and “speaks, reads, and writes.” The census 
classification of literacy was fairly highly correlated with test performance. The 
mean square contingency correlation between claims of Farsi literacy and test 
performance was .63, corrected for broad categories by using four and ten 


categories for triangular distributions.’ 

Generally, a very low level of skill was sufficient for employees to classify 
themselves literate. Only 81% of the number claiming to be literate could be 
said to be able to read and write even simple words (that is, 25.2%/31.0%). An 
even lower per cent (67%) of those who claimed to be literate achieved this 
level of skill. On the other hand, almost twice the number claiming to be 
literate can read and write numbers (that is, 59.7%/31.0%). 

By knowing the per cent of subjects at each level of test performance who 
claim to be literate, and assuming that these drivers do not differ from other 
groups in their interpretation of the meaning of literacy, it is possible to predict 
the test performance of other groups for whom census data are available. 

b. English Literacy. The position of driver theoretically requires reading and 
filling out trip tickets in English. In fact, instructions are oral and trip tickets 
are usually filled out by special clerks. 

Since only eleven drivers claimed to be able to read and write English and 
all the others claimed no ability in English, the drivers fall into two very un- 
equal groups (see Table 580b), which are too limited to be more than suggestive. 
Again, there is a significant relationship between test performance and census 
classification of literacy. A mean square contingency coefficient of .64, cor- 
rected for broad categories by using four and three categories for one triangular 
distribution,® was obtained. 


§ Peters, C. C. and Van Voorhis, W. R., Statistical procedures and their mathematical bases. New York: McGraw- 
Hill. 1940. 398-9. 
Ibid. 
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TABLE 580a. FARSI LITERACY TEST PERFORMANCE OF CENSUS 
LITERATES AND ILLITERATES 


Per cent of 144 | Per cent of 320 Per cent of 
Test performance self-reported self-reported total 
literates illiterates (N =464) 


Read and wrote complex sentences 10.4 
Read and wrote simple sentences 23.6 
Read and wrote simple words 32.6 
Read and wrote numbers only 29.2 
Could not read and write at all 4.2 


Total 


With English a higher level of skill seemed necessary for employees to classify 
themselves literate. More were found able to read and write simple words or 
sentences than claimed to be able to read and write (133% or 3.2%/2.4%). 
About 24 times as many drivers were able to read and write numbers as claimed 
to be able to read and write (57.39%/2.4%). It should be observed that almost 
as many drivers were able to read and write numbers in English as in Farsi. 
This is probably a result of their constant exposure to English numbers on the 
job; all numbers on the instrument panels of the European and American 
vehicles used in the Company were in English. 


TABLE 580b. ENGLISH LITERACY TEST PERFORMANCE OF CENSUS 
LITERATES AND ILLITERATES 


‘ Per cent of 11 Per cent of 453 Per cent of 
Test performance self-reported self-reported total 
literates illiterates (N =464) 
Read and wrote words or sentences 72.7 1.5 3.2 
Read and wrote numbers only 9.1 55.2 54.1 
Could not read and write at all 18.2 43.3 42.7 
Total 2.4 97.6 100.0 


Because these subjects differed in their tendencies to exaggerate their claims 
to literacy in Farsi and in English, test performance in one language could not 
be predicted accurately from knowledge of census results in both languages and 
test performance in the other language. 


4. DISCUSSION 


Zarkovic’ concluded from his sampling study following a Yugoslav census 
that literacy is a continuous variable. Ignoring differences among ability to 
write, to read written symbols, and to translate these symbols into meanings, 
there are still gradations of literacy which will be difficult to define or to differ- 


7 Op. cit. 
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entiate. Even with the use of tests it will be difficult to obtain comparability 
among languages. In the present study the use of numbers as a fundamental 
stage of literacy is an innovation which may be of value in future research. 
Other gradations of literacy may be defined in terms of frequency of usage of 
words in various contexts. Such reference points for tests would enhance the 
utility of studies which attempt to estimate population literacy from objective 
test performance. 

It is interesting that in neither the present study nor that by Zarkovic was 
there a marked tendency for respondents to exaggerate their literacy. It is pos- 
sible that the least exaggeration occurs under conditions of low literacy. This 
appeared to be the case in contrasts between rural and urban respondents in 
Zarkovic’s study and between a foreign and the local language in the present 
study. Part of this difference may be due to varying interpretations by inter- 
viewers. 


® Lorge, I. and Thorndike, E. A semantic count of English words. New York: Institute of Psychological Re- 
search, Teachers College, Columbia University. 1938. 
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SOURCES OF STATISTICS ON CRIME AND CORRECTION* 


Ronavp H. Beatriz 
California Department of Justice 


Sources of statistics on crime and correction are to be found with the 
aid of both generai bibliographies and a series of special bibliographies 
in this field. The primary sources are the official reports of federal, 
state, and local agencies engaged in the various aspects of law enforce- 
meut, criminal justice, and corrections. Many special surveys have 
been made over the past 40 years supplying valuable data on crime. 
Some of these relate to particular problems, such as, homicide, nar- 
cotics, sex offenders, parole, and juvenile delinquency. Crime informa- 
tion, today, still comes from many scattered sources and lacks 
uniformity. More useful statistics will be available when each state 
assumes responsibility for developing reliable and complete crime sta- 
tistics within its own borders in accordance with accepted uniform 
classifications. 


BRIEF review of sources of criminal statistics in the United States can only 
A outline the general perspective of available statistical data and touch on 
the major bibliographical compilations and the types of studies that provide 
data in these fields. There are many scattered sources of information relating 
to limited geographical areas or specialized aspects of crime and correction, 
but there is very little to aid in really determining the nature and extent of 
crime or the distinguishable characteristics of criminal offenders and what is 
being done about them. 

Statistical information on crime, and corrections or on any of the various 
phases of criminal law enforcement in the United States is much more limited 
than most persons realize. In the first place, there is no official national source of 
data on crime and corrections that covers all of the activity of criminal law en- 
forcement. Under the United States Constitution the police powers were re- 
served to the states. Consequently, there are 49 independent and sovereign 
criminal jurisdictions among the states, with another jurisdiction in the District 
of Columbia (which has a code of criminal law and procedure enacted by 
Congress), and still another arising from the law enforcement functions per- 
formed by several of the Federal Government agencies. Except within federal 
reservations the latter do not encompass the usual crimes that are the subject 
of the general police powers of the states, but rather involve offenses arising 
out of primary functions of the Federal Government such as: taxation, and 
foreign and interstate commerce. 

Information as to what occurs, in terms of the number of crimes committed, 
the number of persons arrested, the prosecution and disposition of offenders, 
the number committed to correctional processes, and the handling of prisoners 
in correctional processes must come primarily from state sources. There is a 
great deal of similarity to be found among the states in their substantive 


* This paper was prepared to supplement statistics on crime and correction in the forthcoming new edition 
of Historical Statistics of the United States compiled under the joint sponsorship of the Bureau of the Census and the 
Social Science Research Council. 
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criminal law and in their criminal procedure, as most of this was derived from 
the common source of English common law in effect in the American colonies. 
Nevertheless, there are many differences to be found from state to state in the 
specific definitions of crimes, in the administrative processes related to carrying 
out the responsibility of law enforcement, prosecution, and corrections, and in 
the penalties that are provided for the commission of the various crimes. 
Further, it is basic in the American form of democratic organization that law 
enforcement shall be primarily a local matter even though the laws governing 
crime and procedure are statewide. This means that, to a large extent, the 
sources of statistical information relating to crime, criminal justice, and cor- 
rections are to be found in the records of local police departments, local courts, 
and local correctional agencies. 

Until relatively recent times there has been little effort to collect summary 
information on crime within the borders of a single state, much less to put 
together an over-all picture of crime within the United States. It is only in the 
last 30 years that an active interest in such developments has produced some 
sources of summary information in this field. These developments will be dis- 
cussed later. 

Consideration of source materials in the field of criminal statistics will be 
made under the following five headings: (1) bibliographies; (2) official or gov- 
ernmental reports; (3) other general surveys; (4) sources of information on 
specialized problems in the field; and (5) individual authorities. 


1. BIBLIOGRAPHIES 


The need to organize sources of information on the very broad problem of 
crime and criminal behavior has led to the development of a good many 
bibliographical compilations in this field. Nearly every text book on criminology 
will supply a series of bibliographical references at the end of each chapter. 

A most comprehensive bibliographical survey was made by Augustus Kuhl- 
man of the University of Missouri under the asupices of the Social Science 
Research Council. This survey resulted in the publication of Guide to Material 
on Crime and Criminal Justice [7] which contained over 13,000 references. 
This material organized in great detail and covering the most significant items 
appearing in the literature of this country up through the year 1927, made 
available for the first time a ready source of most of the information then to 
be found in this field. References to reports and discussions of criminal statistics 
are easily found in this volume. 

This bibliography was later followed by two supplemental bibliographies 
prepared by Dorothy Culver (now Mrs. Dorothy Tompkins), under the 
auspices of the Bureau of Public Administration of the University of California, 
the first covering materials published during the years 1927-1931 [3], and the 
second 1932-1937 [4]. More recently Mrs. Tompkins has prepared two addi- 
tional bibliographical reports in which the materials are both more selected, 
and more descriptive, but nevertheless furnish a thorough supplementation to 
the earlier described volumes. The first of these, Sources for the Study of the 
Administration of Criminal Justice [21], was prepared for the Special Crime 
Study Commissions and the California State Board of Corrections, and covers 
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materials through 1948. The second titled, Administration of Criminal Justice, 
1949-1956 [22], was also prepared for a Special Study Commission on Correc- 
tional Facilities and Services in California. These latter two can be obtained 
through the California Board of Corrections. 

These five comprehensive bibliographies, the original by Professor Kuhlman 
and the four subsequent issues by Mrs. Tompkins, provide valuable and rather 
complete references to most of the source materials that have been published 
relating to all aspects of crime and the administration of criminal justice in- 
eluding statistical source materials. 

The National Probation and Parole Association has published both bib- 
liographical and other materials, particularly in the field of sentencing, proba- 
tion, and parole. Nearly every survey or authoritative study made in the field 
of delinquency and crime has included a useful bibliography of the general 
subject matter covered. 


2. OFFICIAL OR GOVERNMENTAL REPORTS 


Historically, the earliest statistical materials relating to prisoners compiled 
on a nation-wide basis are found in the United States Census Bureau decennial 
census compilations. Such statistics were shown in each decennial census from 
1850 to 1890. Independent counts of prisoners were made by the Census 
Bureau in 1904, 1910, and 1923. In the census of 1950, a special enumeration 
was made of prisoners in institutions on the day of the census, and this material 
was subsequently published as a part of the 1950 census reports. This pro- 
cedure is again planned for the 1960 census. 

Soon after the 1923 compilation, an annual collection of statistical data on 
prisoners in state and federal prisons and reformatories was undertaken by the 
Bureau of Census. Annual summaries on prisoners were published by the 
bureau for the years 1926 through 1946. Only a brief summary was issued in 
1947. Subsequently, the United States Bureau of Prisons assumed responsi- 
bility for this series. A brief annual summary, entitled National Prisoner 
Statistics, has been issued for each year since 1948, giving the number of prison- 
ers received in the institutions of each state and the Federal Government, and 
the number and type of releases. Annual reports analyzing in detail the mate- 
rials collected were to be issued in the National Prisoner Statistics series, but 
to: date the only reports of this type issued relate to prisoners committed in 
1950 and prisovers released in 1951. It is hoped that there will be reports issued 
covering the succeeding years in the near future. In addition to these sum- 
maries, another annual publication is issued in connection with National 
Prisoner Statistics accounting for legal executions throughout the United States. 

A second series of reports inaugurated early in the thirities by the Census 
Bureau was designed to give nation-wide information in the field of judicial 
criminal statistics. Summary information was requested each year from trial 
courts of general jurisdiction (usually one in each county within each state) 
which would show, by offense, the number of persons prosecuted, the disposi- 
tion made of prosecutions, and the sentences imposed on convicted persons. 
The first summary for this judicial series was published for the year 1932. It 
accounted for information received from 16 states. The collection of data was 
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continued for about 15 years, but for no one year were there more than 32 
states reporting. The last annual publication in this series was for the year 1946. 
The series was then abandoned because of its incompleteness and the tremen- 
dous difficulty in obtaining uniform and comparable data from local counties 
without state responsibility for collecting, checking, and editing the informa- 
tion summarized at the end of each year. 

The International Association of Chiefs of Police was long concerned with the 
need for more information on the amount of crime reported to the police and 
the number of persons arrested. This organization sponsored a study to de- 
velop a plan for the collection of such data, and in 1929 the plan was published 
in a volume called Uniform Crime Reporting. [6] This volume outlined reporting 
forms and procedures that could be followed by each police department in 
order to supply monthly and annual information on crimes reported to the law 
enforcement agencies and on persons arrested. In 1930, the Federal Bureau of 
Investigation was authorized by Congress to undertake operation of such a 
reporting plan among the police departments of the country. This resulted in 
a series which today is published semi-annually and annually under the title, 
Uniform Crime Reports. These reports summarize the number of offenses in 
the seven offense groups identified as Part I Offenses, which had been selected 
for the reason that they represent the types of criminal offenses that would be 
reported in nearly every instance to a law enforcement agency. The information 
on crimes is shown in both annual and semi-annual reports for the individual 
police departments of most of the major and many of the lesser cities of the 
country. In addition, summary data are shown for the United States as a whole, 
relating to persons arrested by offense and age. In December, 1958, a special 
issue of Uniform Crime Reports was published containing the report of a con- 
sultant committee which formulated recommendations for changes in this sys- 
tem of statistics. As a result, in the future only one report of this series will be 
issued on an annual basis. This report will be preceded by quarterly summaries 
of limited information. 

It may be noted that Uniform Crime Reports is a collection of data voluntarily 
supplied by individual police departments throughout the country, and that 
not all departments contribute to this information. However, it does account 
for most of the nation’s major city departments, and an increasing number of 
sheriffs’ offices and rural police agencies are reporting in this series. 

One of the major difficulties always involved in a compilation of data re- 
ceived from a large number of individual sources (in the case of Uniform Crime 
Reports, several thousand police agencies) is that there is a great deal of un- 
evenness and variability in the data despite the fact that uniform instructions 
are issued. This renders it impossible to make geographical comparisons or even 
time comparisons of the data with any great accuracy. Further, the data com- 
piled in these reports are not complete for any single state, and the state totals 
must be limited te the summations of data from cities reporting within the 
States. 

The soundest method for compiling criminal statistics for the country as a 
whole would be to have each state take full responsibility for the collection, 
compilation, and interpretation of the criminal statistics relating to local juris- 
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dictions within its borders. At least the definitions of law, the administrative 
procedure, and the general organization of criminal justice are the same within 
each state. To encourage states to assume this kind of responsibility, a Uni- 
form Criminal Statistics Act has been proposed by the Commissioners on 
Uniform State Laws. Only one state at the present time, California, has adopted 
the Act, and has developed a central bureau of criminal statistics for the general 
purpose of collecting and compiling all information on crime within the state. 
Three annual reports are issued by this bureau entitled Crime in California 
since 1952, Delinquency and Probation in California since 1954, and California 
Prisoners since 1952. 

Several other states have one or more central agency or bureau devoted to 
compiling state-wide criminal statistics. Historically, certain state officials may 
have the responsibility for certain statistical data which are requested from 
county agencies or courts. An annual or biennial report of a secretary of state 
or a state attorney general will frequently contain this kind of information. 
However, except for those states that have established a central responsibility 
these reports have been largely issued on the basis of whatever annual data 
might be reported for local or state agencies, without regard to its accuracy or 
comparability and with very little editing. 

The New York State Department of Corrections issues an annual statistical 
summary of information on crimes, arrests, and court dispositions as well as 
data on the prison system. The Massachusetts Commissioner of Corrections 
issues an annual report covering court statistics, probation statistics, and data 
relating to the prison system. The States of Michigan, Minnesota, New Jersey, 
Ohio, Washington, and Wisconsin have developed central agencies for the col- 
lection and summarization of a great deal of information on criminal law en- 
forcement in their states. Louisiana, Texas, and Pennsylvania have recently 
centralized the responsibility for their correctional statistics, and there are be- 
ginnings of similar developments in this direction in a number of other states. 

It has been pointed out that the basic responsibility for criminal law enforce- 
ment, for prosecution, and to a large entent correctional services rests with 
local agencies in the cities and counties. Many of these individual dual local 
agencies, particularly the large ones, issue annual reports in which statistical 
data relating to crimes and offenders that are handled are published. The dif- 
ficulty involved in making use of this kind of statistical information is that 
methods of summarizing and presenting such data are not uniform, so that 
comparisons of the statistics issued cannot be made from local agency to local 
agency; nor are the time periods that ure covered by such reports always 
comparable. 

In addition to the official or governmental reports mentioned thus far, there 
have been two important national surveys conducted under the auspices of the 
Federal Government; one in the early thirties and one late in the thirties. The 
National Commission on Law Observance and Enforcement chairmaned by 
George W. Wickersham and frequently cited as the Wickersham Commission 
was authorized to carry out a series of studies on various aspects of criminal 
law administration. This resulted in some 12 volumes of reports summarizing 
a wealth of valuable information. Report No. 3 [28] of this series prepared by 
Professor Sam Bass Warner of the Harvard Law School was on criminal 
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statistics and was made up largely of a review of the source materials on crimi- 
nal statistics in the United States available about 1930. Another report, No. 4, 
[29] relating to prosecutions, contained considerable statistical data on the 
general mortality of criminal prosecutions. This is a valuable reference source 
for such information covering the latter part of the decade of the twenties. 

A second national survey which compiled statistical information relating to 
release procedures was undertaken under the auspices of the United States 
Attorney General in 1936. This study described the practices of all the states 
with relation to sentencing, probation, prison, and parole, and resulted in the 
publication of five separate volumes: (I) Digest of Federal and State Laws on 
Release Procedures; [23] (II) Probation; [24] (III) Pardon; [25] (IV) Parole; 
[26] and (V) Prisons. [27] These reports contained some limited statistical 
materials which had been collected from various states in connection with the 
survey. 


3, GENERAL SURVEYS 


A great deal of the impetus for the development of better criminal statistics 
came during the decade of the twenties as a result of crime surveys conducted 
in different parts of the country. The first was the Cleveland Survey [14], made 
during 1919 and 1920, and published in 1922. Two different reports were made 
of the work done on the Cleveland Survey. Comprehensive statistics were com- 
piled, accounting for all aspects of criminal law administration within the city 
of Cleveland at this time. These demonstrated, possibly for the first time, the 


case mortality that occurs in the administration of criminal justice between 
offense and arrest and final disposition and sentence of criminal offenders. 

The Cleveland Survey was followed in 1926 by the Missouri Survey, [10] 
another analysis of statistical data showing the processes of criminal justice 
in the State of Missouri. This was followed by the Illinois Crime Survey [1] in 
1929. Again a graphic portrayal was given of the statistics of crimes and arrests 
in the City of Chicago and comparisons were made with other parts of Illinois 
and with the City of Milwaukee. Late in the twenties, New York issued a 
series of studies made by the Commission on the Administration of Criminal 
Justice [12]. Here again, a great deal of statistical data were compiled relating 
to criminal justice in the State of New York. In 1931, a survey by Morse and 
Beattie [11] was published concerning the administration of criminal justice 
in the State of Oregon and particularly the in City of Portland and Multnomah 
County. 

A most significant development in the field of judicial statistics was the 
series of studies sponsored by the Institute of Law at Johns Hopkins University 
in the late twenties. These were made under the direction of Dr. Leon C. 
Marshall, and explored the reporting possibilities of detailed information on 
individual cases for both criminal and civil courts. Data were collected from 
the states of Ohio, New Jersey, Iowa, Maryland, Rhode Island, and Delaware 
[8]. A series of reports describing this work and the forms used together with 
an analysis of the material collected was published during 1931 and 1932 by 
the Johns Hopkins press. One of these reports in particular, Comparative 
Judicial Criminal Statistics: Ohio and Maryland; A Comparison of Trial Courts 
Statistics for 1930, [9] became the basic pattern for developing uniform plans 


q 
ac} 


588 AMERICAN STATISTICAL ASSOCIATION JOURNAL, SEPTEMBER 1959 


in the field of judicial criminal statistics. When the Bureau of the Census under- 
took its collection of such statistics in the various states in the thirities, the 
forms and procedure used were those developed by Dr. Marshall in the Johns 
Hopkins studies. 

All of these surveys and many others of a less comprehensive nature were 
prime sources of statistical data in the field of the administration of criminal 
justice. It was as a result of these surveys that effcrts were made to develop 
regular reporting of information on criminal law and administration so that 
all persons concerned could each year know exactly what had happened in the 
process of criminal justice in their particular area. 


4, SPECIAL PROBLEMS 


Within the general topic of crime and criminal law administration lie many 
special types of offenses and offenders which have been the center of a great 
deal of study. Many of these special problems which have been studied can 
be found under their topic items in general bibliographies and other periodical 
digests and indexes. Only a few of major interest can be identified here. 

Homicide. Special studies have been made of homicides in several instances. 
Particularly does the topic of the death penalty as it is related to this subject 
became one of considerable bibliographical material including a certain amount 
of statistical data. While all but seven states in the United States still retain 
the death penalty, its use as will be observed from the National Prisoners 
Statistics report on “Executions” is decreasing, so that during the last few 
years, 1955-58, the annual number of executions in the country has been the 
lowest in the history of recorded data. 

Narcotics. In very recent years the problem of the growing use of narcotics 
has been receiving a great deal of public attention. Particularly in the states 
of New York and California, which seem to be areas of import of opium deriva- 
tives, there have been special commission studies made of this problem. In 
nearly all of these studies will be found some statistical compilations. 

Alcohol. The alcohol problem is stated to have a high correlative relationship 
with crime. Apart from ordinary criminal offenses, a large portion of the ac- 
tivity of urban police is devoted to the picking up of persons under the influence 
of liquor—the drunk problem is one that faces every police department. Many 
annual reports of municipal police departments will furnish a great deal of in- 
formation concerning the number of arrests for drunkenness. 

Sex offenders. Special attention has been given in many states to the fre- 
quency of sex offenses and the handling of sex offenders. Special commissions 
and special studies have been made in this regard usually to support proposals 
for special legislation for handling the persons involved in these types of 
offenses. 

Juvenile delinquency. Juvenile delinquency has become a topic of considerable 
concern in the last few years. It is alleged to have grown tremendously in 
volume and to account for a substantial number of the more serious criminal 
offenses. Sources of information regarding juvenile delinquency are rather 
meager. The United States Children’s Bureau has been engaged for some years 
in the collection of data relating to children appearing in juvenile courts. The 
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Bureau has devoted a great deal of effort in the development of this type of 
information, and has established in recent years a standard reporting scheme 
in which certain juvenile courts in each state are participating. The data pub- 
lished indicate the volume of workload of the juvenile courts and give a classi- 
fication of the kind of cases involved. However, the great variations that in- 
evitably occur from state to state and even within states from place to place 
as to what is considered juvenile delinquency and what is encompassed under 
the jurisdiction of the juvenile court, make it almost impossible to use these 
data as representing comparability from state to state. The problems of meas- 
uring juvenile delinquency are becoming even more complex and difficult with 
the extention of a great deal of responsibility for juvenile control to police 
agencies throughout the country. Certainly, many more juveniles are con- 
tacted and handled by law enforcement agencies than ever before in the history 
of the country. Yet, these higher counts of juvenile contacts cannot be assumed 
to measure or be an indication of a fundamental increase in juvenile delin- 
quency. A great part of this increase is attributable to greater coverage and 
reporting of incidents that previously were, for the most part, ignored by law 
enforcement agencies. 

Parole. Since parole is the usual form of release from custody, particularly 
from prison and reformatories, much attention has been given to the problem 
of success or failure on parole. There have been at least two national con- 
ferences on the subject of parole called by the United States Attorney General 
in the last two decades. One of these was held in 1941, and one in 1956. Reports 
of these conferences point out the need for developing more accurate parole 
statistics to better know the results of release from institutional treatment. 
There have been several parole prediction studies in the late twenties and 
early thirties (Tibbitts [20], Burgess [2], and Vold [30]) and more recently 
Lloyd Ohlin [13] has issued a manual for parole prediction. Several state parole 
agencies issue annual reports which account for the number of persons placed 
on parole and the number of persons declared violators and returned to prison. 


5. INDIVIDUAL AUTHORITIES 


One of the leaders in the development of criminal statistics within this 
country and also in international efforts has been Dr. Thorsten Sellin of the 
University of Pennsylvania. He has written numerous articles discussing meth- 
ods of gathering and analyzing criminal statistics, and from time to time has 
interpreted many of the data that have been published. Under the auspices of 
the Social Science Research Council he prepared two reports, Research Memo- 
randum on Crime in the Depression [16], and Culture Conflict and Crime [17], 
and under the auspices of the American Law Institute a report entitled, 
Criminality of Youth [18]. These are among the most outstanding contributions 
to the literature of this field and to the interpretation of existing statistical 
data. Dr. Sellin has made many contributions in the field of correctional sta- 
tistics, and also drafted the Uniform Criminal Statistics Act promulgated by 
the Commissioners on Uniform State Laws. He has also made several special 
studies with relation to capital punishment, and the reports he prepared for 
the Royal Commission on Capital Punishment and the Joint Committee of the 
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Senate and the House of Commons of the Canadian Parliament on Capital and 
Corporal Punishment and Lotteries are of extraordinary interest for their 
statistical content [19]. 

The cnly history of criminal statistics in the United States which has been 
published was that of Louis Newton Robinson, History and Organization of 
Criminal Statistics in the United States [15]. Professor Robinson also contrib- 
uted several articles during the seven years following the publication of his 
history discussing plans for improving and reorganizing criminal statistics in 
this country. 

It would be beyond the scope of this brief review to attempt to mention all 
of the well-known scholars and writers in the field who have made special 
studies of value incorporating basic statistical material; however, a few of these 
sources should be mentioned as leads to those who would seek to find basic 
statistical information. Dr. Sheldon Glueck and his wife, Eleanor, have issued 
through the years a series of reports on their work in connection with the 
Harvard Law School, analyzing the outcome of correctional treatment by 
following through the careers of different sets of persons who have been re- 
leased from various types of correctional institutions. 500 Criminal Careers [5] 
was the first of this series of studies. It has been followed by several publications 
along similar lines including studies of women reformatory inmates and juvenile 
delinquents. 

There are many other authoritative contributors who have produced valu- 
able source materials on crime and corrections; Paul Tappan, Donald Cressey, 
Donald Clemmer, Peter Lejins, Joseph Lohman, Austin MacCormick, Lloyd 
Ohlin, Walter Reckless, and George Vold, to mention only a few. 

Among the journals published regularly that specialize in the field of crime 
and correction and from time to time have brief studies yielding valuable 
statistical information are: The Journal of Criminal Law and Criminology and 
Police Science, The National Probation and Parole Association Journal, and for 
more limited and general information, Federal Probation, and the American 
Journal of Correction. Very often valuable materials are published in the 
sociological journals of the country and specialized studies appear in journals 
of psychology and psychiatry. While the many law journals, published usually 
under the auspices of the major law schools of the country, deal largely with 
legalistic materials, nevertheless, from time to time valuable studies in the field 
of criminal administration or juvenile delinquency are published in them. The 
Annals of the American Academy of Political and Social Science have devoted a 
number of excellent issues to the crime and corrections problem. 


6. CONCLUSION 


It would appear from this limited survey of sources of statistics on crime and 
correction that there is no really satisfactory source of information which can 
be used to show the amount of crime, the number of offenders, or the particular 
handling of them for the United States as a whole. A fairly comprehensive 
amount of information on the crime situation is available each year in a few 
states and a number of localities, but the lack of comparability and the in- 
completeness of the data make it of limited use in reaching general conclusions 
regarding crime throughout the country. 
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The data relating to the amount of crime and the number of arrests can come 
only from the records of law enforcement agencies and because of the thousands 
of individual agencies performing these functions, this statistical area is the 
weakest and least comparable of all. While the Uniform Crime Reports based 
on data collected from most of the major police departments of the country is 
compiled on the basis of uniform definitions and instructions, nevertheless the 
fact that several thousand independent agencies furnish these reports with 
variant practices among themselves as to the handling of crime reports and as 
to the booking of persons arrested, plus the fact that all this eccurs in 50 dif- 
ferent criminal law jurisdictions, makes it utterly impossible to assure any real 
comparability in the data so reported and compiled. Even the national prison 
collection, which is probably the most comparable from state to state, and the 
most accurately reported (there are only approximately 170 institutions in the 
country classified as prisons or reformatories for adults) is subject to some of 
the same limitations of separate state jurisdictions differing materially in defini- 
tions of crime and in the selection of offenses which are punishable by 
imprisonment. 

Until the time comes when offenses reported to the police can be analyzed 
uniformly from the original crime reports and classified into more meaningful 
subdivisions of the major criminal offense types, there will be little chance 
that the information on crimes, or even on persons arrested and booked, will 
have sufficient consistency to furnish a reliably comparable picture of crime in 
the various states and localities of this country. Progress is being made in this 
direction, primarily from the fact that police departments and sheriffs’ offices 
are becoming more and more professionalized and concerned with accurate ac- 
counting of the work that they do. If, as it is hoped, the next ten years show a 
growth in the acceptance of the states for the responsibility of the development 
of their own criminal statistics, and at the same time provision can be made 
to bring together in uniform terms the data collected in each of these states, 
there will then exist a much more satisfactory source of statistical information 
on the crime and correction problems of the United States as a whole. 
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PUBLICATION DECISIONS AND TESTS OF 
SIGNIFICANCE—A COMMENT 


Gorvon TuLLock 
University of Virginia 
The custom of only publishing research when it reaches a certain 


degree of significance is likely to lead to errors, not through repetition 
of the same experiments, but over many different experiments. 


HE purpose of this comment is not to criticize Sterling’s important article,' 

but to suggest that it has somewhat wider application than he implies. The 
specific type of error to which Dr. Sterling refers (p. 33) is the repetition of some 
given experiment by different investigators until one of them obtains a signifi- 
cant result, which is then published. It do not think this is at all a frequent 
occurrence. If an hypothesis had been tested by a number of independent in- 
vestigators, all but one of whom had obtained negative results, the publication 
of the positive results of the single investigator who had had the good fortune 
to obtain a “significant” result would normally bring the others out in the open. 
A man who has devoted a good deal of time to testing a hypothesis with a given 
experimental design and has been forced to conclude that the hypothesis is 
incorrect will normally simply file his results. If, however, he sees an article 
reporting a similar experiment with significant results, he will at least write 
the editor of the journal in which it appears. Since this type of comment on 
articles is rare, I judge that the duplication of experimental designs is rare. 

But while this indicates that investigators have minds sufficiently independ- 
ent that they seldom duplicate others’ experiments by accident, it does not 
reduce the importance of Sterling’s basic point. The necessary restrictions 
imposed on articles by the scarcity of publication resources does result in greatly 
reducing the confidence we can put in statistical tests of significance as reported 
in the literature. There is little difference, statistically speaking between the 
likelihood of a false correlation from 20 investigators all investigating inde- 
pendently the same false hypothesis with the same experimental design, or 
from the same 20 investigators investigating 20 false hypotheses. The same 
number of results of a given level of significance would be expected from either 
“design.” 

The moral of these considerations would appear to be clear. The tradition of 
independent repetition of experiments should be transferred from physics and 
chemistry to the areas where it is now a rarity.? It should be realized that re- 
peating an experiment, while not necessarily showing great originality of mind, 
is nevertheless an important function. Journals should make space for brief 
reports of such repetitions, and foundations should undertake their support. 
Academics in the social sciences should learn to feel no more embarrassment in 
repeating someone else’s experiment than their colleagues in the physics and 
chemistry departments do now. 


i “Publication Decisions and Their Possible Effects on Inferences Drawn from Tests of Significance—or Vice 
Versa,” Theodore D. Sterling, Journal of the American Statistical Association, March, 1959, p. 30. 

2 A good illustrative example of the practice in the physical sciences is A. B. Mah's “Heats of Formation of 
Alumina, Molybdenum Dioxide, and Molybdenum Trioxide,” Journal of Physical Chemistry, 61 (1957), p. 1572. 
Dr. Mah repeated a series of careful measurements made by several other scientists. Since his work simply confirmed 
theirs it was reported in the form of a brief note rather than in a full article. 
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SOME FINITE POPULATION UNBIASED RATIO 
AND REGRESSION ESTIMATORS* 


M. R. Mickey 
General Analysis Corporation 


A class of ratio and regression type estimators is given such that the 
estimators are unbiased for random sampling, without replacement, 
from a finite population. Non-negative, unbiased estimators of es- 
timator variance are provided for a subclass. Similar results are given 
for the case of generalized procedures of sampling without replacement. 
Efficiency is compared with comparable sample selection and estimation 
methods for this case. 


1. INTRODUCTION 


wo of the well recognized ways of bringing related information to bear on 
the problem of estimating the values of a population characteristic are of 
present interest. The first is the selection of an estimation procedure which in- 
corporates into the estimator values of related variables whose population 
characteristics are known. The second is the selection of a sampling plan in 
which the probabilities of observing the population elements are dependent 
upon the values of the related characteristics. 
Perhaps the simplest estimators that take account of the related information 
are the ratio estimators 


and = uf (1.1) 


where #, #, and 7 are the sample means of y;, z;, and r;= y;/x; and p, is the popu- 
lation mean of the related characteristic x. It is well known that these are biased 
estimators of yu, and that the bias is an important disadvantage in some appli- 
cations. Hartley and Ross [4] gave an elegant expression for the biases and 
developed an unbiased version of 9 as 


y’ = + —— (jf — 72). (1.2) 


Robson [8] derived this estimator, and others, from a different point of 
view and gave an unbiased estimator of the variance of y’. Goodman and 
Hartley [3] developed less complex estimators of the variance of y’ and gave 
an extensive discussion of the relative efficiencies of tne estimators 7, 9, and y’, 
and some regression type estimators. Olkin [7] considered a multivariate ver- 
sion of y and y’ and developed an expression for the variances and estimators 
of the variances for each. 

The present paper presents a broad class of unbiased ratio and regression 
type estimators. This extension of the set of available unbiased estimators may 
be of practical interest, particularly where regression type estimators are ap- 
propriate, as well as of theoretical interest. The class of estimators is defined in 


* Most of this research was done while the author was associated with Iowa State College. It was completed 
while he was associated with the RAND Corporation. 
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section 2 and examples are given in section 3. The examples show that y’ be- 
longs to the class. The variance of estimators is not investigated, so that ques- 
tions of relative efficiency are not treated. However, the important problem of 
estimation of estimator variance is considered in section 4 and it is shown that 
simple unbiased non-negative estimates of estimator variance can be readily 
computed for a sub class. Numerical examples are given in section 5. 

The second way of utilizing related information, that of incorporating it into 
the sampling procedure as a basis for determining the probabilities with which 
various population elements come into the sample, has been discussed by 
Horvitz and Thompson [5], and others. The idea underlying construction of 
unbiased estimators for the random sampling case is extended to the case of 
the more general sampling methods in section 6. Some new unbiased estimators 
are obtained, along with unbiased estimators of the variance of estimator. The 
efficiency comparison of these estimators with those considered by Horvitz and 
Thompson indicates approximate equal efficiency. 


2. THEORETICAL DEVELOPMENT 


Let the finite population of N elements be represented by the set of (p+1) 
component vectors (4/3, tps), B=1, 2,---, N, and define the 
following population parameters as 


Tis, 


The problem of present concern is the estimation of u, from a random sample 
(without replacement) of n elements, given that N and the y,, are known. 

The estimators to be presented stem from the evident fact that for any choice 
of constants a), - - - , a, the estimator 


i=1 


is an unbiased estimator of u,, where 7 and #; denote sample means. This fact 
is exploited in connection with a second observation: for any choice of a of the 
sample elements, a<n, the remaining n—a elements constitute a random sam- 
ple of size n—a from the finite population of N—a elements derived by ex- 
clusion of the selected a elements from the given population. Consequently, 
the a; maybe chosen as functions of the a selected elements and estimator 
(2.1) may then be applied for the sample of size n—a to obtain an unbiased 
estimator for the derived population. This in turn leads to an unbiased estima- 
tor for the original population, since the relation between the population char- 
acteristics for the two populations is determined by a, N, and the a selected 
elements. This is the idea developed here. 

For the purposes of formal development, suppose that the sample elements 
are drawn one at a time, so that the sample is an ordered set, the order being 
that of the order of draw. Let Z, denote the ordered set of observations on the 
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first « sample elements, 1Sa<n, and let a;(Z,) denote functions of these ob- 
servations. Let Y(a), X;(a) denote the sums and g(a), %(a), i=1,---, p, 
denote means of the indicated observations on the first a sample elements. 
Finally, let E{|Z.} denote conditional expectation given Zz. 

In terms of the above notation, an estimator ¢, is defined by the following 
equivalent expressions: 


tml 


= + [ vem) | 


- 
N(n — a) 
N 


The form of t, given by (2.2) is instructive in that the first term is a biased 
estimator of u, and the second term is an estimate of the bias. It is interesting 
that the estimate of bias can be viewed as being obtained by applying the form 
of the biased estimator to the subsample in estimating the sample mean 9; the 
estimate of bias of the first term is a multiple of the resulting estimate of bias 
of the estimator when applied to the subsample as estimating the sample 
mean. 
The interest in ¢. is formally established as: 


Theorem. The estimator ¢, is an unbiased estimator of yy. 
Proof, Expression (2.1) is applied by observing that 


ni — aij(a) NE, — 


are the sample means of the last n—a sample elements, and that 


Nu, — Nuz,; — 
N-e@ 


i=1,---,p 


are the means of the derived population obtained by deleting the first a sample 
elements. Let u, denote the statistic corresponding to (2.1), 


(2.2) 
(N — a)n 
N(n — 
(2.3) 
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so that 
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Zab = 


Since 


(N — a)a + af(a) 
N 


ta = 


it follows that 
E{t.| Za} = by 2.6) 
and hence that the unconditional expectation has the same value, 
E(ta) = py. (2.7) 


Thus ¢, is an unbiased estimator of py. 

A general class of unbiased estimators may now be constructed by including 
all estimators of the form of ¢, applied to any permutation of the ordering of the 
sample elements and weighted averages of such estimators. Also included in 
the class are estimators obtained in this way from subsamples of the given 
sample. The above theorem may be applied to show that all estimators of the 
class are unbiased. Of particular interest is the estimator ¢,* obtained from ¢, by 
averaging over all permutations in the ordering of the sample elements. It may 
be noted that this is analogous to the application of Blackwell’s Theorem [1] 
where the unordered sample plays the role of the sufficient statistic. Corre- 
spondingly, it follows that the variance of ¢,* is not greater than the variance 
of te. 

The term “ratio and regression type estimators” has been used in a more or 
less conventional sense. The term is not well defined. In a special sense, an 
estimator may be termed of a regression type if it is invariant with respect to 
location and scale changes in the z-variables, and undergoes the same location 
and scale change as the y-variable. A ratio type estimator may be characterized 
similarly by limiting consideration to scale changes only. In this sense, the 
estimator y’ considered by Hartley and Ross, equation (1.2), is a ratio type 
estimator. Whether or not an estimator of the class defined above is of the ratio 
or regression type in this sense will depend upon the properties of the coefficient 
function a;(Zq). 


3. EXAMPLES 


Particular estimators result from the specification of the coefficient functions 
a;(Z_). This choice will ordinarily be made along the lines indicated by linear 
regression theory. For example, for the case p=1, the ratio estimates result 
from minimization of 
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w(z;) E - * (3.1) 


=z 


with respect to variation in y,, where yu, is known and w(z) is, loosely speaking, 
inversely proportional to the conditional variance of y given x. Direct minimiza- 
tion leads to the biased ratio estimators 


This leads to the choice of a(Z,) as 


(3.2) 


Dd 


a(Z.) = (3.3) 
2;*w(zx;) 


j=l 


The resulting unbiased estimators are presented for the cases: (a). w(x) =1/z?, 
and (b). w(x) =1/z. 


a. = (3.4) 


By using (2.4) there results 


i = 


= 


in which 


jul 
Note that 4,*=y’, the estimator considered by Hartley and Ross. It can be 


shown that t.*=y' for a=2,---,n—1 as well as for a=1, so that no new 
estimators are obtained by this averaging process for different values of a. 


(3.7) 


Expression (2.4) is again convenient and yields the estimators 


(N —a)n 
= Raps + | 


(N — a)n 
Kotte + N(n — ly 


~ i 
| 
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These are ratio type estimators similar to the customary 


y 


Computationally, the cases a=1 and a=n—1 are the most convenient. The 
case a=1 reduces to (3.5), (3.6). For a=n—1, R,_; has the form 
nj - Yi 


R,-1 = — )» — (3.10) 
jai NI — I; 


so that ¢*,,_, is fairly simple to compute. 
An example of a regression type estimator is provided by using the familiar 
form for the linear regression coefficient 


y;|2; — &(a)| 


a(Z.) = = be. 


— #a)]? 


j=l 


Equation (2.2) yields the form 


a(N — 


(n — a)N 


It is evident from (3.11) and (3.12) that this estimator is of the regression type 
in the sense specified in section 2. That is, if y;’=a+by;, 2;’=ce+dz;, d+0, 
then t,’=a+btq. 

The estimator derived from (3.12) by averaging over all permutations is more 
readily computed for a=n—1 than for any other value of a. Let b,’ be the value 
of the regression coefficient if the j-th sample element is omitted, i.e. 


b,’ = 


n 
— #)? (x; — 
i=! n—1 


Then becomes 


_N 


| na | (3. 14) 


j=l 


Nn 
It is evident that unbiased versions of a wide variety of estimators can be ob- 
tained along the lines of these examples. The extension to the case p>1 is 
straightforward; presumably the case of greatest interest is that given by the 
usual least squares regression coefficients. The case of general weighted regres- 


sion does not appear to be accommodated within the class of estimators under 
discussion, but requires an extension. 
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4. ESTIMATION OF ESTIMATOR VARIANCE 


The problem of constructing unbiased, non-negative estimators of the vari- 
ance for any estimator of the class described in Section 2 appears formidable. 
It is interesting, however, that for a subclass of these estimators, the prob’ern 
is readily solved. The discussion of this section is confined to this case. 

It is convenient to introduce a slight notational change by explicitly display- 
ing the size of the sample upon which the estimate is based; ¢t(a, n) denotes the 
estimator ¢, based upon a sample of size n. In these terms, the type of estimator 
under consideration may be described as follows. Let there be given +1 
integers 0<ai;<ae< <ax4:=n, and consider the k estimators t(a:, as), 
t(a2, a), t(ax, n). The estimator t(a;, is based upon the first aj; 
sample elements. The estimator of present interest is the average of these, 


= — aj41). (4.1) 


j=l 


Since each ¢(a;, a;41) is an unbiased estimator of p,, so is 7. 
An unbiased, non-negative estimator of the variance of / is given simply as 


k 


1 
= kk-D [t(aj, — (4.2) 


This follows from the consideration that the t(a;, aj4:), j=1,---, k, are 
mutually uncorrelated, as may be demonstrated by the use of equation (2.6): 
Let E.{ } denote the conditional expectation E{ |Z.}. Then, for i<j, 
E{t(a, } E{ EBa,,,{t(ai, aj41) } } 
= El Ba, } } 
E{t(ai, ai+1) My} 


= py’. 


The unbiased nature of s*(#) is readily shown by using this property. 
Note that there is a great deal of flexibility in this construction. For example, 

for sample size 3, a1=1, a2=2, a;=3, one may take ¢(1, 2) as the ratio type 

estimator (3.5), and ¢(2, 3) as the regression estimator (3.12). In any event 


4[t(1, 2) — ¢(2, 3)]? 


is an unbiased estimator of the variance of the estimator 
4[#(1, 2) + ¢(2, 3)]. 


The precision of estimators of type (4.1) can be improved by averaging with 
respect to a random sample, or all, of the possible orderings of the sample ele- 
ments. For the given set of sample elements let #, and s,? denote the values of 
(4.1) and (4.2) for the order represented by y. Let 71, - ++, Ym be a random 
sample (without replacement) of the n! possible orderings, and consider the 
estimators 


v 
| 
ihe 
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(4.3) 


3°(T'm) = — 8,2 — — ~ (i,, — Tn)?. (4.4) 


Then T,,, is an unbiased estimator of yu, and s?(7’,,,) is an unbiased estimator of 
the variance of 7’,,. The unbiasedness of (4.4) follows from the considerations 


1 1 a 
> (ty, My)? (ty, — T,)*? + (T. — (4.5) 
m 


— Gy — = vari) = — Eh, + var(Ta), (4.6) 
m 


and the unbiased character of s,,”. 

It can be seen from (4.6) that most of the gain in precision can be achieved 
with relatively small values of m. Let E{ |S} denote conditional expectation 
with respect to the unordered set of sample elements S, and let 7* denote T,,, 
the estimator obtained by averaging over all permutations. 

It follows from standard formulae that 

(m — 


and this, combined with (4.6) leads to the result 


(m — 1)n! 
var(7T,,) = var(7T*) 
m(n! — 1) 
that is, that var (7) is a weighted average of var (7*) and var (#). Equation 
(4.8) can be written equivalently as 


E{(@i—T7*)?|S}, (4.7) 


(m — 1)n! 


var(t), (4.8) 


1 ni —m 
var(T'n) = var(7*) + — [var(#) — var(T*)]———- (4.9) 
m n! — 1 
and in this form shows that most of the variance reduction can be attained with 
small values of m. 


5. NUMERICAL EXAMPLE 


The calculations for the estimating procedure specified by (4.1) and (4.2) 
can be compactly arranged. An illustration of this is provided by applying the 
ratio type estimators (3.8) to an example constructed by Cochran [|2, Table 
6.1]. In this example the population is a selection of 196 United States cities. 
The estimation problem is that of estimating the total number of inhabitants 
in these cities in 1930 on the basis of the 1930 census count for a random sample 
of these cities and 1920 census information for all of the cities. The variable y is 
the number of inhabitants of a city in 1930 and z is the corresponding number 
for 1920. The unit of count in this example is 1,000 individuals. Cochran gives 
values of x and y for a random sample of 49 of the 196 cities. 


q ] 
m 
1 
™ 
“a 
; 
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Example 1. The population, for this example, is the set of 49 elements given 
by Cochran. A random sample of five elements was selected; the elements 
(y:, 2;) in the order drawn were: (63, 37), (58, 50), (80, 76), (53, 45), and (113, 
121). The estimator illustrated is that given by (4.1) and (3.8) with a;=7: 
N-i+l1 


— 1) + [Y@) 6.) 


— 1, 2) 


i Y 


Yiy 


j=l 


Y(?) 


(5.2) 


TABLE 602. ILLUSTRATION OF COMPUTATIONS FOR ESTIMATOR 
SPECIFIED BY (4.1) AND (5.1) 


uz = 103.1429 py = 127.7959 


Y() N-i+1 


i Y(i) X(i) | 
- N 


2 121 87 1.7027 — 27.135 0.9796 149.04 
3 201 163 1.3908 — 25.700 0.9592 118.80 
4 254 208 1.2331 — 2.485 0.9388 124.85 
5 367 329 1.2212 —34.775 0.9184 94.02 


The estimators / and s*(é) are calculated from the entries in the last column of 
Table 602 as: 


i= 149.04 + 118.80 + 124.85 + 94.02 
4 
_ (149.04)? + + (94.02)? — 4(121.6775)? 
(4)(3) 
Example 2. For large samples, the computational work may be reduced by 
choosing a relatively small value for k. The procedure is illustrated for Coch- 


ran’s example for the choice k=4. 
Equation (3.8) takes the form 


(N — ai-1) 
N(a 


= 121.7 


= 127.7 


s*(2) 


as) = + [Y(as) — (5.3) 


where 


ai ai i) 
Y(a) = X(a) = Daj, and R(a) = 

j=l j=l xX (a) 
The calculations are shown in Table 603. As before, the desired estimates are 
calculated from the last columa of the table on the basis of equations (4.1) and 
(4.2). The values obtained are 


(5.4) 


= 
where 
— 
| 
t(i—1, 2) 
| | 
| 
“i 
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i = 145.90, s(i) = 0.71. 


The value of y, for the example, [2] is 149.75, in units of 1,000 individuals. 

A relatively small value of k was chosen for this example to provide a com- 
pact illustration of a computational routine. It should be kept in mind that 
since s*(#) has k—1 degrees of freedom, the precision of the estimate of estima- 
tor variance is directly related to the size of k. In this example s*(é) has 3 de- 
grees of freedom. The large absolute value of the Student ¢ratio, 5.42, although 
less than the tabulated one per cent point, indicates that 0.71 is an underesti- 
mate of the standard deviation of ?. 


TABLE 603. ILLUSTRATION OF COMPUTATION FOR ESTIMATE OF 
1930 POPULATION OF 196 UNITED STATES CITIES. DATA FROM 
COCHRAN [2, TABLE 6.1] 
= 116.933673 


X (a;) 


N (ai —aj-1) 


1.163531 168 .574818 .096606 
1.230373 10.103736 .075255 
1.233373 23 .095862 .084184 
1.238737 1.423302 .098852 


6. EXTENSION TO OTHER SAMPLE SELECTION METHODS 


The preceding discussion assumed that the sample selection method used is 
that of random sampling without replacement. The underlying idea can also 
be exploited in the case of more general sample selection methods. These have 
been discussed by Horvitz and Thomson [5], and others, as alternative, or 
additiona! means of bringing related information to bear on the estimation 
problem. Jessen [6] has given an interesting application for the estimation 
of total fruit count of a tree. As is demonstrated by this example, such pro- 
cedures may introduce simplification in sample selection as well as utilize re- 
lated information. 

This section further develops the basic idea of this paper by supplying addi- 
tional unbiased estimators, along with non-negative, unbiased estimators of 
estimator variance, for the more general sample selection procedures. These 
procedures usually exploit related information by means of selecting elements 
with probabilities proportional to the size of a variable correlated with the 
variable of interest. Another possibility is that of choosing selection probabili- 
ties on the basis of “conditional variance,” say, and utilizing the related varia- 
bles in the estimator, along the lines previously considered. This possibility 
stems from the consideration that the following development replaces the ob- 
served sample sequence , With a sequence - - - , wu, such that the 
u; are uncorrelated estimators of the population characteristic; actually the 
characteristic property is given in terms of «onditional expectations: 


2 603 

| | 

5 | 804 | 691 | = | - 
ey 2| 19 3103 2522 174.790 
31 4154 3368 144.632 
4} 41 5334 4306 146 .167 
ae 5| 49 | 6262 | 5054 144.991 
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If the sample of related variable, x, - - - , 2, is replaced similarly by 1, - - - , vp, 
then the correlation between y and z can be expected to persist as a correlation 
between u and v. These possibilities are not further developed here. The follow- 
ing discussion pertains, formally, to the case of a single variable, y. 

Let the elements of the finite population be indexed by 8, 8=1, - - - , N, and 
let y(8) be the value of interest associated with the 8-th population element. 
The problem of interest is estimation of the population total. 


N 
= (6). (6.1) 


Let p(6:) be the probability that 6; is the first population element drawn, and 
let p(B;| , 8:1) be the conditional probability that 8, is the i-th element 
drawn given that §:,---, i+ specify the first i—1 sample elements, 
i=2,---,n. It is assumed that p(B:| Bi, -++, Bi1)>0 for B;¥8;, 
j=1,--+-,%t-—1. Finally define the statistics u;,i=1, - - - , n as follows: 


p(B1) 
y (Bi) 


(8;) + 


It is well known, and readily apparent, that ~ is an unbiased estimator of 7,. 
By the same reasoning, it follows that 


E.s{us} T's, i= 2, (6.3) 


(6.2) 


that is, that the conditional expectation of u,, given the first i—1 sample ele- 
ments is equal to 7T,. Therefore, the u; are mutually uncorrelated unbiased 
estimators of Ty. Consequently 


1 n 
i= (6.4) 


N 


= — (6.5) 


n(n — 1) 


are unbiased estimators of T, and the variance of a, respectively. 

The equal weighting of the w; gives rise to a loss of efficiency. Consideration 
of the special case of random sampling suggests that this loss will ordinarily be 
trivially small. The ratio of the variance of a to the variance of N¥ for this case 
is given as 

var(i) — r? 


1 <1 + —_ 
var(N9) 3N(N — n) 3(1 — r) 


== 
e 
an 
; 
: 
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where r=n/N is the sampling rate. It is of passing interest to note the observa- 
tion, made by Jessen, that @ is not a consistent estimator of Ty. 

By an argument similar to that used in section 4, it follows that the precision 
can be improved by computing the conditional expectation of a with respect to 
the unordered sample, and that an unbiased estimator of the variance can be 
provided. It is to be noted that if this computation is replaced by a sample of 
the permutations, the permutation chosen must be selected with ¢_ >ropriate 
probabilities; in general, the permutations are not equally likely. 

A case of particular interest is that for which the probability structure is the 
following: 


p(B:) 
» Bi ¥B;, =1,---,t-1 
p(B; | Bi, =} 1—p(Bi)— - - - —p(Bi-1) (6.7) 


otherwise. 


To simplify the notation, let p; and y; denote p(@;) and y(8;). Equations (6.2) 
then become: 


If (6.8) is written i 


UW 


Di 


it is seen directly that a has zero variance if p(8) is proportional to y(8), 
B=1,---,N. 

Equations (6.8) together with (6.4) and (6.5) provide a convenient computa- 
tional procedure for calculating the estimates @ and s?(a). For the special case 
n= 2, the expressions simplify to: 


1 
ne ) (6.10) 
2\p Pe 


(6.11) 


For averaging with respect to the order of draw, the weights are 


1 — pe 


(6.12) 
2— pi — 


where the subscripts of w indicate the sequence. The resulting estimator of 7’, is 


if 
the 
Yi 
Pr 
i-1 i-1 Vi 
i 
= i=2,---,n. (6.8) 
j=l j=l Pi 
BS: n the form 
Pr 
2— — Ps 
é 
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An unbiased estimator of the variance of u* is given as 
s*(u*) = E{s*@a)| S} — E{(a u*)*| S}, 


when S denotes the unordered set of sample elements. The terms in this ex- 
pression may be evaluated as 


1 


+ 2)? 2 2 
S} = (Ps Ps ( - (6.15) 
4(2 — — P»)* Pz 

so that s?(u*) can be expressed as 


st(u*) = {! -| (= (6.16) 
4 


As an example of the use of these estimators, consider the illustrative exam- 
ple constructed by Jessen [6]. The problem dealt with the estimation of the 
total fruit count of a tree on the basis of a sample of terminal branch fruit 
counts. The terminal branches were selected by a sequence of choices of con- 
tinuing branches at the forks of the tree. Thus the observational data consists 
of the fruit count of a terminal branch and also the probability with which that 
particular branch was selected, for the sample of terminal branches. The illus- 
trative tree has eight terminal branches and a total fruit count of 64. The 
illustrative sample of two terminal branches gives as data: 


ES (a@ — u*)? 


1 
= 5, n=— —=4 
Ss 
2 ’ 16 Ds 


Application of formulae (6.13) and (6.16) gives the results 
1 1 


28 
u* = 68 — — = 67.03 
29 


1 
u* = > (40 + 96) + (40 — 96) 


st(u*) 1— 


(46 — 96)? 


st(u*) = 638. 


| 

2 
16 8 16 
8 16 


RATIO AND REGRESSION ESTIMATORS 


The estimator considered by Jessen was 


1 
2\p Pe 


The bias of this estimator can be expressed as 


as can be seen from (6.13), where 


y p(B) Fe | 
= ——_| —- 7, 6.19 
a1 1 — p(8)L p(s) 


Since p(8) will ordinarily be small, the bias will at most be but a small fraction 
of the standard deviation of 7’. 

Horvitz and Thompson compared the relative efficiencies of several sampling- 
estimation procedures in [5] with respect to an illustration sampling problem. 
The data consisted of number of households and eye estimated number of 
households for each twenty city blocks. The total number of households was to 
be estimated from a sample of two of the blocks. It may be of interest to add 
the estimators under present discussion to this comparison. The variances com- 
pared [5] range from 16,219 for the case of unrestricted random sampling with 
a multiple of the sample mean as the estimator, to 3,075 for a sampling scheme 
and estimation,method presented by Horvitz and Thompson. The variance of 
tu, (6.10) is 3,055 for the case of selection probabilities p(8) proportional to the 
eye estimated number of households. Although the variance of.u*, (6.13), was 
not calculated, it is bounded by the interval 3026<var (u*)<3038. These 
values were calculated by application of Equations (6.34) and (6.37), which are 
developed below. 

The approximate equality of efficiency of the estimators @ and u* with those 
of the sampling-estimation procedures of Horvitz and Thompson is more than 
an accident of the data of the example. The difference of the variances can be 
accounted for by the difference in the “finite population correction factor” that 
arises from lack of “replacement” in the sampling. For samples of size one, the 
two are identical. For samples of size greater than one, Horvitz and Thompson 
consider the estimator 


*) (6.20) 
Pn 


for sampling systems such that 
P(8) = np(8) (6.21) 


where P(8) is the probability that the 6-th element is in the sample. This in- 
sures that 7',, (6.20), is an unbiased estimator of Ty. A slight modification of 
the equation for the variance of T given [5] is 


607 

1 y 
E{T} —T, = — —cov(—, ——}, (6.18) 
2 p 

: 

pee 

n 
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where Z(8) = y(8)/p(8) — T,, and n(n—1)p(a, 8) = P(a, 8) is the probability that 
both the a-th and the 6-th elements are in the sample, a8. If p(a, 8) is in- 
terpreted as a bivariate probability frequency function, it is characteristic of 
sampling without replacement that 


p(s, 8) B=1,--:-,N. (6.23) 
Note also that p(a, 8) is symmetric and that 


p(a, B) = p(s). 


a=l 


Equation (6.22) may be expressed in the form 
var(T,) = — [1 — A(n — 1)} (6.25) 
n 


where o?=var (7) and is’a weighted average of - - , Av-1, the negatives 
of the N —1 characteristic roots that differ from unity of the matrix 


| 
| B=1,---,N. (6.26) 
| V p(a) p(B) 
Thus the factor [1—A(n—1) | in (6.25) appears as the “correction factor”. It 
follows from (6.23) that the sum of all N characteristic roots of (6.26) is zero. 
Since the neglected root is unity, the arithmetic average X of the d;,7=1, -- -, 
N-—1,is 1/N—1. Thus 


1 —X(n — 1) = 
X(n — 1) pear 


(6.27) 
which is the usual form for random sampling. This indicates that (6.27) will be 
appropriate for many purposes. For the Horvitz-Thompson example, the value 
of o? is 6492. The use of (6.27) as the correction factor in (6.25) gives the value 
var (7;)~3075, which is in substantial agreement with the reported values. 

The weightings of the A; in determining the value of A in (6.25) depend upon 
the population {Z(8) } being sampled. For each \;, - - - , N—1, there isa 
corresponding set of values {Z(8)}, so that all values of \ in the interval de- 
termined by the \,; are possible. The appropriateness of (6.27) thus depends 
upon the variation of the \;. For example, for n=2 and p(a, 8) appropriate to 
stratified random sampling, N —2 of the \; are zero and the remaining value is 
unity. It is also possible to construct p(a@, 8) for which a negative value of X; is 
possible. Consequently it is desirable to consider the distribution of the A;. A 
partial resolution of this problem can be made if the sampling system has the 
following characteristic: for any pair 82, if p(8:)=p(G2), then 
= p(Bs, a), for all If the sampling system has this characteristic, and 
if for some pair p(6,) = p(S2), then 


‘ 


(6.24) 
4 
a5 
ee 
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Be) 
p(B1) p(B2) 


is the negative of a characteristic root of (6.26). This result suggests that if the 
y(8) are ordered such that p(1)<p(2)< --- <p(n), then 


_ 
+ 1) 


are acceptable approximations to the \,. For the particular case of scheme I [5], 
for which 


(6.28) 


1 1 
the approximation (6.28) appears quite adequate. These results suggest that 
the variation in the \, is closely related to the variation among the p(7), and 
that the largest A; will be about of the order of the largest of the p(z). Conse- 
quently, if the largest p(7) is small, even though there is large relative variation 
among the p(7), the variation of the A, will be sufficiently confined that the 
usual correction factor, (6.27), is appropriate. 

These conclusions are plausible on general grounds. It is sufficient to con- 
sider the case n= 2. Suppose that the most probable element leads to an under- 
estimate and that all other elements lead to overestimates. Then sampling 
without replacement is clearly advantageous relative to sampling with replace- 
ment, and the extent of the advantage decreases with a decrease in the value of 
the probability of the most probable element. If all probabilities are relatively 
small, then it becomes unlikely that the same element appears in both draws in 
sampling with replacement, so that there is but little relative advantage to 
sampling without replacement. 

The above type of formal analysis is pertinent to a problem posed by Horvitz 
and Thompson [5]. This is the problem of how to choose p(a, 8), for given 
p(8), so as to optimize the variance of the estimator 7, (6.20). It has been shown 
that the variances, for different sampling procedures, differ in the correction 
factor, and hence that for sampling procedures that have the characteristic 
specified above, the differences are relatively small. It follows further that no 
sampling procedure is uniformly best. For if such a procedure, leading to 
p*(a, 8), had uniformly minimum variance, then, for any other feasible system, 
(6.22) leads to the inequality 


Yp*(a, 8)Z(a)Z(8) < Lp(a, B)Z(a)Z(8) 


for all {Z(8)} such that }°Z(8)p(8) =0. Because of (6.24), this is equivalent to 
the requirement that the matrix 


q(a, 8) = ||p(a, 8) — p*(a, 8)|| (6.29) 


be positive definite. Since the diagonal elements of g(a, 8) are all zero, from 
(6.23), the sum of the characteristic roots of g(a, 8) is zero. Henze g(a, 8) cannot 


A 
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be positive definite, and no sampling system leading to a specified set of 
{p(8)} is uniformly best. 

Expressions for the variance of a and u*, (6.4) will be developed for the case 
n=2 only, (6.10) and (6.13). For this case, it appears that the finite population 
correction factor will ordinarily be comparable to those for the Horvitz- 
Thompson procedures. Hence there is to be expected but small differences in 
efficiency for the two types of procedures. This does not eliminate interest in 
the estimation procedures under discussion for two reasons. One is that in 
applying the Horvitz-Thompson procedure, the determination of the sampling 
‘procedure to yield the desired p(8) is somewhat cumbersome. This is particu- 
larly the case for applications of the type considered by Jessen [6]. The second 
reason is that in general there do not exist solutions to the sampling procedure 
problem to yield an arbitrarily specified set of probabilities {p(8)}. This was 
pointed out by Horvitz and Thompson with respect to their “sampling scheme 
I”, where p(8) is restricted to the range 


1 
~?4) n 


An expression for the variance of % can be developed for the case n =2 from 
the consideration 
var(a) = 3[var(u) + var(ue) | (6.30) 


since uw; and uw are uncorrelated. The variance of w is given, directly or from 
(6.22) with p(a@, 8) =0, as 


N 2 
var(m) =o? = >> — T,?. (6.31) 


The variance of wz may be developed by applying this equation to the condi- 
tional expectation 


B¥8, p(B) 
Ty 1 
= [1 — p(6;) Jo? (6.32) 
p(B1) 


to yield 


N N 
var(us) = (1 -> o* — [y(8) — Typ(8)]%. (6.33) 


B=1 


Hence the variance of @ is given by substituting (6.31) and (6.33) into (6.30): 


N o 1 N 
var(a) = [2 -> > [y(s) — Typ(8)P. (6.34) 
Bel 


Be 
Ve, 
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If this is expressed in a form exhibiting the “correction factor” there results 


var(a) = (1 — (6.35) 


where # is the weighted average of the p(8) given by the weights 
B 2 
1 p(8) 
p(s) + —; “at 
| 
~ p(B) 


The variance of u* is a multiple of var (7%). As can be seen from (6.14) and 
(6.16) the factor is a weighted average of 


| p(a) + p(B) 
2 — — p(s) 


Consequently a bound for var (u*) is given as 


E (; = -) | var(a) < var(u*) < E -) | var(%), (6.37) 


where q is the average of the two largest of the p(8), and q, is the average of the 
two smallest. Since the p(8) will ordinarily be small, this result shows that @ is 
very nearly as efficient as u* for the case n=2. In connection with (6.6), the 
result for random sampling, this suggests that the same will be the case for 
larger values of n. 


For sample sizes greater than two, it will suffice for many purposes to observe 
the inequality 


w(8) = (6.36) 


2 


var(a) < (6.38) 
n 
This follows from the lack of correlation of the u; and the non-increasing prop- 
erty of the sequence {var (u;) 4 Equation (6.33) implies that 
E{ (uny Ty)? | fi, 
= (1 — A)E{ (un — T,)?| Br, Bra}, (6.39) 


where h is a positive function of y(8;) and p(@;), i=1, - - - , n—1. Therefore 
there are positive constants k, and k such that 


var(Uny1) = (1 — kny1) var(un), 
var(tny1) < (1 — k)"o? (6.40) 


where as before o? is the variance of u. This leads to the inequality 


n nk 


(6.41) 
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If the constant k is fitted from the case n = 2, then for small n the approximation 


2 
var‘) (1 — (n — 15), (6.42) 


where is the correction factor given by (6.35) and (6.36), should ordinarily be 
adequate. This again shows that but small differences in efficiency are to be 
expected in comparison of @ with the sampling-estimation procedures con- 
sidered by Horvitz and Thompson. 

Finally, it may be noted that estimation procedures of the type considered in 
this paper apply to the sampling scheme in which the first m, say, of the sample 
elements are selected according to a general probability method and the last 
(n—m) elements selected by random sampling from the remaining population. 
This might be done as a hedge against a poor choice of selection probabilities 
p(8). Uncorrelated estimates u; can be computed according to the method of 
this section, i=1, - - - , m, and according to the methods discussed in section 4 
for7=m+1,---,n. In any event, 


1 
— du; 
n 


- 1 - 


provide unbiased estimators of u,, or T',, and of the variance of @. 
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CONFIDENCE INTERVALS FOR THE MEANS OF DEPENDENT, 
NORMALLY DISTRIBUTED VARIABLES 


JEAN DUNN 
University of California (Los Angeles) 


Several possible methods are presented for constructing confidence 
intervals for the means of normally distributed, dependent variables 
when nothing is known about the correlations. One, which uses the 
Student ¢ distribution, is found, when the degrees of freedom is not too 
small compared to the number of variables, to give intervals almost as 
short as can possibly be attained. Methods based on Hotelling’s 7 and 
on Scheffé’s confidence intervals for all linear contrasts are found to 
yield intervals appreciably longer than those using the ¢ distribution. 

The extent to which these intervals may be shortened when some 
knowledge of the correlation structure is available is suggested as a 
problem for investigation. 


1. INTRODUCTION 


N VARIOUS types of research, several different variables are measured from 

the same sample. The data obtained may be highly correlated, and usually 
the degree of correlation is unknown. 

In biological research, for example, growth data are often obtained, with 
measurements taken on the same sample of n individuals at k different times; 
the k measurements for each individual would be highly correlated. In medical 


research, the effect of a drug might be measured on n individuals at k different 
times; here again correlations would be present. The psychologist might meas- 
ure ” individuals’ responses to k different levels of a stimulus; a high degree of 
dependence might be anticipated. Economic time series would be another exam- 
ple of a situation where the assumption of independence would be unjustifiable. 

The research worker might wish to do various things with his data. If he is 
interested in linear contrasts among the means 


( Lio + ps where yi, we, , we denote 
2 means of the k variables , 
he would naturally estimate them using the same linear contrasts among the 
sample means 

— — etc., where 7; = 
If the variables are normally distributed and their variances are assumed to 
be equal, Henry Scheffé [5] and John W. Tukey [6] have each given methods 
for obtaining simultaneous confidence intervals for all such linear contrasts. 

On the other hand, the research worker might be interested in fitting a curve 

to his data. Paul G. Hoel [3] has given a method for estimating a regression 
curve and obtaining a confidence band for it which may be used for dependent 
variables, provided one assumes the existence of a polynomial regression curve 
of known degree. 
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In the present paper it is assumed that the experimentalist is interested in 
confidence intervals for the k separate means rather than in confidence inter- 
vals for linear contrasts or for a regression curve. This could be the case, for 
example, if he does not wish to make the rather laborious calculations to obtain 
the confidence band for a regression curve, or if he is unwilling to assume the 
existence of a regression curve of a known degree. He looks, therefore, for k 
separate confidence intervals, one for each mean, with the assurance that, with 
a high probability, each interval of the set contains its mean. 


2. NOTATION 


Let the k variables be y:, y2, - - - , ye, and let the measurements on the j-th 


individual in the sample be y:;, y2j, - , Yes, for j=1, 2, - - , n. Throughout 
this paper, it will be assumed that y:, yz, - - - , yx follow a multivariate normal 
distribution with means yi, ue, , we, Variances - and un- 
known covariances. 
Let , be the k sample means and let 
= (ys — 9)?/(n — 0), ¢=1,2,---,&k, 
j=l 


be the sample variances. 


3. CONFIDENCE INTERVALS WHEN THE VARIANCES ARE KNOWN 


As might be anticipated, the shortest intervals can be obtained for the k 
means of normally distributed variables if one knows the variances, 
oi, -- +, on%. Ina recent article [2], I showed, using a Bonferroni inequality, 
that regardless of the correlations among the variables, 


—< = y —< 


21-—a, (1) 


where c, is the 1 —(a/2k) point of the standard univariate normal distribution. 
Here P denotes the probability that all k events 


Gi — Ca <i < + 
Vn 


occur simultaneously. Thus, the joint confidence intervals for , ue, 
with confidence level 1 —a, are: 


Here cq is defined as in (1), and its values may be obtained for 1—a=.95 and 
for various values of k in the last row of Table 615. 

If the confidence level is chosen to be near unity, say .90 or larger, these con- 
fidence intervals are, for practical purposes, the shortest ones obtainable when 
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TABLE 615. VALUES OF 1—.05/2k POINT OF STUDENT t DISTRIBUTION 
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to 


one knows the variances but does not know the correlations. As a measure of 
length, the square root of the expected value of the squared half-length of the 
interval has been used. 

If the variables are known to be independent, then one can multiply probabil- 
ities very simply and instead of using the above confidence intervals, one would 
tend rather naturally to make intervals from 


Vn 


= i Yi Ca | = — a, 
y y a 


where c, is the 
1+ (1 — a)" 
2 


point of the normal distribution. Such intervals are the shortest possible ones 
for the means of k independent variables. In the situation where nothing is 
known about the correlations, any confidence intervals used must certainly be 
applicable for the case when variables are independent. Therefore, one cannot 
hope to find intervals for the general case any shorter than those given by 


where c, is the 


1+ (1 — 
2 


point of the univariate normal distribution. 

In my earlier paper [2] I conjectured that for the general case when nothing 
is known about the correlations, (3) may hold with the equality sign replaced 
by a greater than or equal sign. Then 


k 

Lo 1 2 3 4 5 6 7 8 9 10 15 | 20 | = 
v 
534° 
5 
10 
15 
20 
24 

40 
60 
120 

(3) 
| 
~. 

¥i £ Ce 
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+ 


could be used as intervals for y;, with c. the 
1+ (1 — 
2 


point of the univariate normal. I have proved this conjecture, however, only 
for k=2 and 3, and also for all k provided the correlations between the 7-th and 
j-th variables are of the special form b;b;, where 0<b;<1,i=1, 2,---,k. 

In the last row of Table 616, values of c. for 1—a=.95 are given for these 


TABLE 616. VALUES OF 95 PER CENT POINT OF STUDENTIZED 
MAXIMUM MODULUS 


k 
1 2 3 4 5 6 7 8 

v 

5 2.57 3.09 3.40 3.62 3.78 3.92 4.04 4.14 

10 2.23 2.61 2.83 2.98 3.10 3.19 3.28 3.35 

15 2.13 2.47 2.67 2.81 2.91 2.99 3.06 3.12 

20 2.09 2.41 2.59 2.72 2.82 2.90 2.97 3.02 

24 2.06 2.38 2.56 2.68 2.78 2.84 2.91 2.96 

30 2.04 2.35 2.52 2.64 2.73 2.80 2.86 2.91 

40 2.02 2.32 2.49 2.60 2.69 2.76 2.82 2.86 

60 2.00 2.29 2.46 2.56 2.65 2.72 2.77 2.82 
120 1.98 2.26 2.43 2.53 2.61 2.68 2.73 2.77 

~ 1.96 2.23 2.39 2.49 2.57 2.64 2.69 2.73 


shortest intervals. As mentioned in the last paragraph, they may be used only 
for k=2 and 3, or for higher k when the correlations are of the form b,b;. It 
may be noted that the intervals in Table 615 are nearly as short as those of 
Table 616. Since for Table 615 c. is the 1—a/2k point of the normal distribu- 
tion, and for Table 616 c, is the 


1+ (1 — a) 
2 


of the same distribution, the Bonferroni interval of Table 616 is almost as good 
as it is possible to achieve for 1 —a close to 1. 


4. CONFIDENCE INTERVALS WITH VARIANCES UNKNOWN 


If the variances are unknown, then the shortest intervals for which tables 
are likely to be available are similar to those in Section 3, with the tables of 
the Student ¢ distribution used in place of the normal distribution. 

The confidence intervals with confidence level 1—a are 

82 
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where ¢, is the 1—(a/2k) point of the Student ¢ distribution with n—1 degrees 
of freedom. For 1—a=.95, values of c, for various k and n are given in Table 
615. Here v=n—1. 

If equal variances are assumed, s,”, - - - , s,2 may be replaced in (4) by 


k 
8," = 8:°/k; 


then c, is the 1 —(a/2k) point of the Student ¢ distribution with k(n —1) degrees 
of freedom. When equal variances are assumed, then as in Section 3 slightly 
shorter intervals could be obtained if the variables were known to be inde- 
pendent. Again for k=2 and 3 and for any k with correlations b,b;, these can 
be used for dependent variables. 

The intervals are: 

Sp _ 8p 

fi + Ca + Ca 
Here c. is the 1—a@ point of the Studentized maximum modulus, as given by 
Pillai and Ramachandran [4]. Values of ca for 1—a=.95 are found in Table 
616; the table must be entered with »=n—1. 

Values of c. have been included in Table 616 for several values of k larger 
than 3. Since for larger k they may be used only if the correlations are of the 
form b,b;, they are of no help when nothing is known about the correlations, 
and have been included mainly to indicate how much room there is for im- 
jrovement. For larger k and small », there is an appreciable difference between 
the figures in the two tables. 

It is possible that the intervals of (5) can be shown to be usable for any cor- 
relation structure and for any k. Even though someone does succeed in estab- 
lishing them mathematically, however, the intervals of Table 615, using the 
Student ¢ tables have an advantage in requiring no assumption about equality 
of variances. 


(5) 


5. CONFIDENCE INTERVALS WITH VARIANCES UNKNOWN BUT EQUAL— 
F DISTRIBUTION 


The research worker may find, in attempting to use the intervals of Section 
4, that for his values of 1—a, k, and n, the Student ¢ tables available to him 
are inadequate. He may then consider intervals which are based on Henry 
Sheffé’s simultaneous confidence intervals for linear contrasts [5], and which 
use the F distribution. He must assume equal variances, however, and the inter- 
vals are considerably longer, becoming worse comparatively as k increases. 
They are 


Vn 
Here C2 = Vik Fa, Where F, is the 1 —a point of the F distribution with k and n—1 


degrees of freedom, and s,? is the pooled estimate of the variance. 
For 1—a=.95, values of ca may be found in Table 619a. Here v=n—1. 


tite (6) 


t 
pe 


618 AMERICAN STATISTICAL ASSOCIATION JOURNAL, SEPTEMBER 1959 


6. CONFIDENCE INTERVALS WITH VARIANCES UNKNOWN—HOTELLING’S 7 
When n is larger than k, by circumscribing a rectangular parallelopiped 
around the confidence ellipsoid one can use Hotelling’s 7 distribution to obtain 
a set of confidence intervals [2]. Since 
k(n — 1) 
= ———F 
n—k 


T? 


where F has k and n—k degrees of freedom, it is convenient to use tables of 
the F distribution. The intervals are: 


Sk 


Cu 
Vn 


fi + 


+ ie + Ce (7) 


k(n — 1) 


where F is the 1—a point of the F distribution with k and n—k degrees of free- 
dom. For 1—a=.95, values of c. are given in Table 619b. These intervals are 
seen to be longer than those from Table 619a; their advantages are that tables 
are readily available and that no assumption about equality of variances need 
be made. 


Jn 


Here 


7. A NUMERICAL EXAMPLE 


As an illustration of the use of these confidence intervals, I use some observa- 
tions on the growth of rats given in a paper by Box [1]. The weights of 10 rats 
given a certain treatment (thiouracil added to their drinking water) were re- 
corded weekly 5 times (yo, ¥1, Y2, Ys, Ys). The means, variances, and standard 
deviations are as follows: 


It would be unreasonable to expect the variances of the five different times 
to be equal, so therefore one must use either the intervals of Section 4 (Table 
615) or those of Section 6 (Table 619b). Choosing the former, entering the table 
with k=5, v=9, and interpolating with the reciprocals of the degrees of free- 
dom, the following intervals are obtained: 

49.9to 59.5 grams 
68.1 to 84.5 grams 
87.0 to 104.6 grams 
98.1 to 118.3 grams 
112.4 to 135.6 grams 


4 

Vn 
a 

4 

i 

| 
| Yo | Ye | | 

Mean 54.7 76.3 95.8 108 .2 124.0 ae 
22.01 62.68 72.18 95.07 126.67 
8; 4.692 | 7.917 8.496 9.750 11.254 
10 1.48 | 2.50 2.69 3.08 3.56 
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TABLE 619a. VALUES OF VkF-%(k, v) 


wwww ow 


TABLE 619b. VALUES 


Plotting these intervals gives an idea of the location of the growth curve. 

As an illustration of a slightly different use for the intervals of this paper, I 
use more data from Box [1]. A control group of 10 rats was also weighed, and 
it would be of interest to look at differences in the weekly gains in weight be- 
tween the thiouracil group and the control group, rather than at the weights 
or at differences in weights themselves. 

First of all, I will make the assumption that for the i-th week, the variances 
in the gains in weight are equal in the control group and in the thiouracil group, 
say o,?. It would seem fairly reasonable also to assume that all four o,* are 
equal, but since to use Table 615 it is unnecessary to assume equal variances, 
there is no particular reason for making this assumption. 

The mean gains in weight, their differences, and the calculation of the 


619 

k | 

He! 1 2 3 4 | 5 6 7 8 

ae 5 2.57 | 3.40 | 4.03 .56 | 5.02 | 5.45 | 5.84 | 6.21 
ap 10 2.23 | 2.86 | 3.34 .73 | 4.08 | 4.40 | 4.69 | 4.96 

pis 15 2.13 | 2.71 | 3.14 .50 | 3.81 | 4.09 | 4.36 | 4.60 

ee 20 2.09 | 2.64 | 3.05 39 | 3.68 | 3.95 | 4.19 | 4.43 

Bel: 24 2.06 | 2.61 | 3.00 .34 | 3.62 | 3.88 | 4.12 | 4.34 

a 30 2.04 | 2.58 | 2.96 .28 | 3.56 | 3.81 | 4.04 | 4.26 

ae 40 2.02 | 2.54 | 2.92 .23 | 3.50 | 3.75 | 3.97 | 4.18 
3 60 2.00 | 2.51 | 2.88 18 | 3.44 | 3.67 | 3.90 | 4.10 

ee 120 1.98 | 2.48 | 2.84 13 | 3.38 | 3.62 | 3.83 | 4.02 
mS Ps 1.96 | 2.45 | 2.79 .08 | 3.32 | 3.55 | 3.75 | 3.94 

F-*(k, »—k+1) 

»—k+1 

Bo 1 2 3 4 5 6 7 8 

“ae 5 2.57 | 4.17 | 6.81 | 13.86 | 75.88 | — | Pee 

on 10 2.23 | 3.08 | 3.91 | 4.85 | 6.05 | 7.71 | 10.32 | 15.36 

ae 15 2.13 | 2.83 | 3.44 | 4.04 | 4.67 | 5.38 | 6.20 | 7.18 

rt 20 2.09 | 2.72 | 3.25 | 3.74 | 4.22°| 4.72 | 5.26 | 5.84 

os 24 2.06 | 2.67 | 3.16 | 3.61 | 4.03 | 4.46 | 4.90 | 5.37 

ne) 30 2.05 | 2.63 | 3.08 | 3.48 | 3.86 | 4.24 | 4.60 | 4.98 
= 40 2.02 | 2.58 | 3.00 | 3.37 | 3.71 | 4.03 | 4.34 | 4.66 

6 60 2.00 | 2.53 | 2.93 | 3.27 | 3.57 | 3.85 | 4.13 | 4.38 

os 120 1.98 | 2.49 | 2.86 | 3.17 | 3.44 | 3.70 | 3.93 | 4.15 
at 0 1.96 | 2.45 | 2.79 | 3.08 | 3.32 | 3.55 | 3.75 | 3.94 
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variances is as follows, where yo is the initial weight and y;* is the gain in weight 
during the i-th week: 


Mean: 
Control 
Thiouracil 
Difference =d; 


10 
Sum of Squares: — 91)? 
Control 210.5 
Thiouracil 260.4 
Total 470.9 
Estimate of Variance of Gain in Weight: s,;*? 
Control 23.39 
Thiouracil 28.93 
Pooled 26.16 
Estimate of Variance of Mean Gain in Weight: 
s;*?/10 
Control 2.339 
Thiouracil 2.893 
Pooled 2.616 
Estimate of Variance of dj: 
5.232 
Estimate of Standard Deviation of d;: 


2.29 


Entering Table 615 with k=4 and »=18, and interpolating with reciprocals 
of degrees of freedom, the confidence intervals are: 
2.9+(2.78)(2.29) or —3.5 to 9.3 grams 
8.0+(2.78)(2.01) or 2.4 to 13.6 grams 
11.7+(2.78)(2.52) or 4.7 to 18.7 grams 
14.7+(2.78)(2.74) or 7.1 to 22.3 grams 


Thus one would conclude that after the first week, rats given thiouracil gain 
less on the average each week than rats not given thiouracil; during the first 
week the evidence is inconclusive as to the direction of the difference. 


8. DISCUSSION 


All the intervals of this paper are of bounded confidence level; that is, the 
probability of coverage is greater than or equal to 1—a rather than equal to 
1—a. This is not surprising, since the intervals must cover the true means no 
matter what correlations exist among the variables. For a given correlation 
structure, whether the probability of coverage is exactly 1—a or whether it is 
larger than 1—a, however, is of mainly academic interest; the research worker 
is interested in knowing that the probability of coverage is at least 1—a and 
also in having short intervals. 

When the variances are known and when the correlations are all zero, then 


54.0 24.5 27.5 24.1 30.5 a 
54.7 21.6 19.5 12.4 15.8 —- 
-2.9 8.0 11.7 14.7 

202.5 | 310.9 | 258.5 at 

160.5 | 262.4 | 419.6 oe 

363.0 | 573.3. | 678.1 

22.50 | 34.54 | 28.72 - 

17.83 | 29.16 | 46.62 aa 

20.16 | 31.85 | 37.67 cok 

2.250| 3.454 | 2.872 

1.783 | 2.916 | 4.662 

2.016 | 3.185 | 3.767 

4.033 | 6.370| 7.534 

2.01 2.52 2.74 


CONFIDENCE INTERVALS 621 


the probability of coverage using the intervals (2) is exceedingly close to 1—a. 
This is evidenced by the fact that the last row in Table 615 is almost as short 
as the last row of Table 616, and the statement is true for larger values of 
k also. 

When all the correlations are equal to +1, then the actual probability of 
coverage can be obtained by using the univariate normal tables. For example, 
when k=8, v=, and a=.05, cz=2.74, and the probability of coverage is 
.9969. Thus, for k =8 and for any set of correlations, the probability of coverage 
lies somewhere between .95 and .9969. 

Of more direct interest to the research worker is the question of how much 
he may hope to shorten his confidence intervals if he knows something about 
the correlations. When k=8 and v= ~, with the correlations all +1, then 2.74 
can of course be shortened to 1.96; with the correlations all zero, the intervals 
can be shortened from 2.74 only to 2.73. The problem of the extent to which 
the confidence intervals may be shortened if the research worker is willing to 
assume that all the correlations are greater than some positive number, is an 
interesting one which should be worth pursuing. When k=2, and v= ~, tables 
for the bivariate normal distribution show that the 2.23 which must be used 
for p=0 and a 95% level may be shortened by only a very small amount for 
p known to be as high as .8. Whether the same situation may exist in higher 
dimensions and for » finite is an open question. 
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A BASIS FOR THE SELEC'’!0N OF A RESPONSE 
SURFACE DESIGN* 


G. E. P. Box 
Princeton University 


AND 
Norman R, DraPer 
University of North Carolina and Imperial Chemical Industries 


The general problem is considered of choosing a design such that 

(a) the polynomial =f(&, &, +, &) in the continuous vari- 
ables =(£:, &, °° +, &) fitted by the method of least squares 
most closely represents the true function g(&, &&) over 
some “region of interest” R in the — space, no restrictions being 
introduced that the experimental points should necessarily lie 
inside R; and 

(b) subject to satisfaction of (a), there is a high chance that in- 

adequacy of f(€) to represent g(&) will be detected. 

When the observations are subject to error, discrepancies between 
the fitted polynomial and the true function occur: 

(i) due to sampling error (called here “variance error”), and 

(ii) due to the inadequacy of the polynomial f(€) exactly to repre- 

sent g(&) (called here “bias error”). 

To meet requirement (a) the design is selected so as to minimize J, 
the expected mean square error averaged over the region R. J contains 
two components, one associated entirely with variance error and the 
other associated entirely with bias error. 

There is a class of designs which satisfy requirement (a). To meet 
requirement (b) we select from this class a subclass for which the 
“non-centrality term” in the expectation of the residual sum of squares 
in the analysis of variance is large. This leads to a sensitive test of 
goodness of fit. 

In this paper the theory is applied to the particular case where f(&) 
is a polynomial of first degree and g(€) a polynomial of second degree; 
that is, the experimenter is hopefully fitting a first degree equation aoa) 
over the region R in the circumstances where the true function is really tie 
quadratic. The somewhat unexpected conclusion is reached that, at 
least in the cases considered, the optimal design in typical situations : 
in which both variance and bias occur is very nearly the same as would 
be obtained if variance were ignored completely and the experiment de- 
signed so as to minimize bias alone. 

Particular examples of the class of optimal designs derived are frac- 
tional by replicated two-level factorial designs (in which no two-factor 
interaction is confounded with the main effect) with added center 
points. 

It is proved (Appendix 1) that if a polynomial of any degree dj is 
fitted by the method of least squares over any region of interest F in the 
k variables, when the true function is a polynomial of any degree d2 > d,, 
then the bias averaged over R is minimized for all values of the co- 
efficients of neglected terms, by making the moments of order di+d:z : 
and less of the design points equal to the corresponding moments of a 
uniform distribution over R. 


* Paper read at the 31st Session of the International Statistical Institute in Brussels 6th Sept. 1958. Material | 
originally presented at the I.M.S. Summer Statistical Institute, July 1957, and prepared as Technical Repori No. Ps 
23, August 1958 in connection with research sponsored by the Office of Ordnance Research, U.S. Army; Statistical 
Techniques Research Group, Princeton University, Contract No. DA 36-034-ORD 2297. 
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1. INTRODUCTION 


Sse a functional relationship 


exists between a response » and k continuous variables £1, &, - - - , &. Suppose 
further that in order to elucidate certain aspects of this relationship, measure- 
ments of » are to be made for each of N combinations of levels of the variables 


= (fin, feu), (u 1, 2, N) (2) 


1.1 The Problem. The problem of experimental design here considered is the 
choice of the design matrix D of N rows and k columns. This matrix specifies 
the levels of all the factors for all the measurements to be made and its u-th 
row is given by the elements of &,’ in (2). 

The considerations that influence the choice of D are different in different 
circumstances. Two particular cases that arise are: 


(i) when, the form of the true functional relationship 7 =g(€, ®) (which is of 
course not necessarily linear in the parameters 6 or in the variables &) 
being assumed known, the object is to estimate the parameters 6,, - - - , 0p; 
and 
when, the form of the true functional relationship being unknown, the 
object is to approximate within a given region R of the k-dimensional 
space of the variables, the function g(&, 6) by some graduating function 
S , Bi, - Br). The function f(E, 8) would often be a 
polynomial in which case § would be the 1X1 vector of polynomial 
coefficients. 


The two different objectives lead to different types of design. Designs ap- 
propriate for (i) may be called designs for estimating parameters; designs ap- 
propriate for (ii) may be called designs for exploring a response surface (the 
surface involved being that defined by the function (1) in the (k+1)-dimen- 
sional space of the response 7, and the variables £,, &, - - - , &. The first problem 
has been discussed [11, 9, and 7]. This paper, however, is specifically concerned 
with the second problem and not with the first and it is supposed that the 
functional form g(£, 6) is unknown and is to be graduated locally by a polynomial 
f(E, 8) of degree d, in &. In accordance with an already established notation 
[8, 6] a design suitable for fitting a polynomial of degree d; is called a design 
of order 

Now, in practice, the nature of the variables whose levels are represented 
by £,, will change from one application of a design to another. In one case, for 
example, &; may refer to a temperature reading and in another to the dosage of 
a drug. Therefore, it is useful to define the general design in terms of “standard- 
ized” variables x; which in any particular application are related linearly to the 
by a transformation 

Ein £;(0) 
Fin 3, (3) 


In what follows we shall choose £;(0) to be the mean of the £;, so that 


623 
i 
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N 
> liu = 0. (4) 
u=l 

The scale factor S;is acu .c. ant relating the standardized levels z; 


in the general design to the leveis us tne actual variables £; to be used in a 
particular application. 

1.2. The Requirements. In a recent paper [6], it was suggested that, when 
the function was to be graduated by a polynomial, suitable requirements for 
a response surface design were as follows: 


(a) The design should allow the graduating polynomial of chosen degree d, 
to represent the true function as well as possible within the region of 
interest; 

(b) It should allow a check to be made on the representational adequacy of 
the polynomial; 

(c) It should not contain an excessively large number of experimental points; 

(d) It should lend itself to “blocking”; 

(e) It should form a nucleus from which a satisfactory design of higher order 
could be built in case the polynomial of degree d; proved representation- 
ally inadequate. 


1.3 The Necessity for Considering Bias as Well as Variance. Most approaches 
to the theory of experimental design have been concerned only with the 
errors arising from “sampling” variation. The graduating function is assumed 
to be capable of providing a perfect representation and the expectation of a 
fitted value #(x) is supposed equal to n(x) the true value. In practice, of course, 
this specification is inadequate because a graduating function such as a poly- 
nomial will always fail, to some extent, to represent the true function. In 
reality therefore there are not one, but two, possible sources of discrepancy 
between the true function n(x) and the fitted graduating function #(x). The 
first occurs because of sampling error and the second because of the inadequacy 
of the graduating function. We refer to the first as the “variance error” and 
the second as the “bias error.” 

Derivation of statistical designs as if “variance error” were the only cause of 
discrepancy has often led to conclusions which are at odds with the experi- 
menter’s natural intuition. Consideration of why this happens helps us to see 
how we ought to proceed. Usually there will be a large “operability region,” 
O, of unknown, or vaguely known, extent within which it is possible to carry 
out experiments and within this, at a given stage of experimentation, a smaller 
“region of immediate interest,” R. In this paper it is assumed that # is entirely 
within O and the boundary of O is never reached by any experimental poirt. 
Within the region of immediate interest R, the experimenter may feel it is 
reasonable to represent the response function by, for example, a polynomial of 
first or second degree, although he may know that such a representation would 
be quite inadequate over the whole operability region O. 

Assuming a particular model, the variance of #(x) at some point x will 
normally decrease as the size of the design is increased, so that if variance error 
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is treated as the only kind of discrepancy we are led to the conclusion that in 
order to obtain a good representation over R we ought to take as large a design 
as possible covering the whole operability region O. This result is at odds with 
common sense and is only reached because we ignore the decreased ability of 
the simple graduating function to represent the real relationship as wider and 
wider regions of the space of the variables are considered. It may be argued 
that to avoid this difficulty one could simply choose the design points to “cover” 
the limited region R. Against this however is the consideration that it might 
turn out that, even where bias was taken into account, to obtain a close fit 
within RF it was best to locate some points outside R. Alternatively it might be 
best to confine all the design points to a region much smaller than R. 


2. METHOD 


Clearly what we require is some way in which the apparent added precision 
obtainable by making the design larger may be balanced against the loss of 
representational accuracy. So far we have mentioned only the choice of size of 
the design. In practice, the values of other quantities (such as \, for a second 
order rotatable design [6]), which determine the distribution of the design 
points in the space of the variables, have to be decided upon and again similar 
questions arise. 

The object of the present paper is to consider a general theory which will 
meet the first two requirements (a) and (b) listed above when both “variance 
error” and “bias error” are taken into account. To develop this theory we use 
a principle which might with profit be adopted more often in statistical in- 
vestigations. This is that, rather than suppose, as is usually done, that the 
assumed model is always correct, we suppose that it is always to some extent 
incorrect. The general theory which is developed is applied to the particular 
problem of choosing a first order design, that is a design which is appropriate 
when the polynomial fitted is of first degree in 2, - ++ , 2. 

2.1 Interpretation of Requirement (a). We shall interpret requirement (a), 
that the design should allow a graduating polynomial of specific degree d, to 
represent the true function as well as possible within the region of interest R, 
in the following way. 

We suppose that, although it is assumed that within R a polynomial of degree 
d; will be adequate, i in fact the true function is a polynomial of higher degree 
d,>d; and that this polynomial of higher degree provides an exact representa- 
tion within the whole operability region O. Let (x) be the value of the esti- 
mated response at the point x where this value is obtained by the least squares 
fitting of the polynomial of degree d; using a design containing N points. Let 
n(x) be the true response at this point given exactly by the polynomial of higher 
degree d, and let o? be the experimental error variance. Then we should like to 
choose the design so as to minimize 


ee 

ts 

ce 

- 
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where dx=d2, dx2 - - - dx,. Thus J is the mean squared deviation from the 
true response, averaged over the region R and normalized with respect to the 
number of observations and the variance. To illustrate the division into 
“variance error” and “bias error,” (5) can be written 


J=V+B (6) 


NQ NQ 
f V [9(x) Jax + — f — n(x) (7) 
o R o R 


f dx. 
R 


Tt will be noted that the expression V is the variance function of the reference 
[6 | averaged over the region R. A similar criterion has been employed in recent 
work by David and Arens [10], and by Folks [12]. 

2.2 Interpretation of Requirement (b). We shall interpret requirement (b), 
that the design should allow a check to be made on the representational ac- 
curacy of the assumed polynomial in the following way. 

We suppose that a test for lack of fit is to be made by the use of an analysis 
of variance in which the residual sum of squares 


where 


N 
Se = (Gu — yu)? (8) 
is compared with the experimental error variance. This test may involve the 
comparison of Sp either with a prior value o? of the experimental error variance, 
supposed to be known exactly, or with some independent estimate s?. In either 
case, a parameter which determines the power of tae test for goodness of fit 
will be the quantity 
N 


— mu]? = E(Sk) — v0? 9) 

u=1 
where v is the number of degrees of freedom on which the residual sum of 
squares is based. While our ultimate object should be to make the power of 
the test as large as possible, in any particular instance in which rv is assumed 
fixed, an intermediate objective will be to make the expectation of Sp large. 

We shall interpret requirement (b) therefore as implying that the design 

should be chosen so as to make E(Sz) large. It seems reasonable to regard 
requirement (a) as being of major importance so that in practice we shall 
proceed by first attempting to find the class of designs which minimizes J in 
(5) and then attempting to satisfy criterion (b) by selecting from this class, a 
sub-class which makes large the expected value of Sp in (8). 


3, APPLICATION : CHOOSING A DESIGN FOR FITTING A STRAIGHT LINE 


We first illustrate the theory in the simple case d;=1, d,.=2, k=1. Here the 
situation is that a relationship 


git 
: 3 
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n = g(&) (10) 


is supposed to exist between the response n and a variable ¢ such as tempera- 
ture. The exact nature of the function g(¢) is unknown but the experimenter 
feels that he can usefully graduate this relationship by a linear equation in 
over a limited region R within O extending from ¢’ to &’’. We assume that over 
some wider operability region O it can in fact be represented exactly by a 
quadratic equation in &. For simplicity and without loss of generality we make 
the transformation 
” 

ay 
so that in terms of x the region R extends from —1 to +1. Thus the fitted linear 
expression intended to approximate the true function over the region R is 


G(x) = bo + biz (12) 
while the true relationship over the whole region O is assumed to be 


n(x) = Bo + Bit + Bux? (13) 


We shall assume in what follows that 


N 
= 0, 
u=1 


which implies that the experimental design is centrally located in the region 
of interest R. We then apply our theory to decide what is the best distribution 
of the experimental points subject only to this restriction. 

3.1 Requirement (a); true model quadratic. Denote the second and third 
moments of the design points respectively by 


N 
= (14) 


u=l 


u=1 


v-—f jae / “ (16) 


1 
(1 + =] +; (17) 
2/74 


Then 


Now with (x) given by the quadratic model ( 13) the expected values of bo 
and }, in the fitted linear equation (12) are 


‘se 

i 

e = [11] 
: 
peu 
N 

i 
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E(bo) = Bot cBu (18) 


[111] 
E(b:) = Bi + (19) 


Therefore 
E$(x) = E(bo) + E(bi)z 


[111] 
= Bo + Bir + > 


whence, with ay? = 


1 1 
{ E9(x) — n(x) }%dx 
-1 


1 


5 3c? 
Combining (17) and (23) we obtain 
J=V+B 


1 2e [111]? 
= E =| — +—+ (24) 


3c 5 3c? 


The quantity V in the first bracket is the contribution from sampling error and 
the remaining quantity B is the contribution from bias. From (24) it is seen 
immediately that whatever the values of c and a, the expression is minimized 
for changes in [111] when this third moment is zero. In the subsequent dis- 
cussion therefore we suppose that [111] =0. 

With [111] set equal to zero 


and the only design characteristic contained in this expression is c. It will be 
noted that while the part V does not contain any of the parameters $, the part 
B, which expresses the integrated bias, contains ay,? = N By?o~? a “standardized” 
measure of the quadratic curvature. The value of ¢c which minimizes the whole 
expression J depends, therefore, on the size of a’. 

Now the sampling error in # is proportional to ¢/+/N so that the quantity 


an = Bu/(c/VN) (26) 


is a measure of the ratio of the quadratic curvature to the sampling error. When 
a, is very small, the curvature is very small compared with the sampling error, 
and the optimum c tends to infinity. On the other hand, whe: a, tends to in- 
finity, the curvature is very large compared with the sampling error and c 
tends to the value }. 
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q 
(20) 
B= (21) 
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== au (22) 
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The quantity +/c is the root mean square deviation of the experimental 
points from the origin. It is a convenient measure of the spread of the design 
points which, if the z’s were random variables, would correspond to the 
standard deviation o, of z. 

In the case where there is no experimental error and all the discrepancy arises 
from bias, that is from the inadequacy of the linear model, we are instructed 
therefore to choose the z, so that the root mean square deviation 


(27) 


To summarize: if we believed implicitly in the assumption of linearity over 
an indefinitely wide region then we should minimize V alone, which would lead 
us to allow the bounds of the design to extend as far as possible. At the other 
extreme, if with R extending from —@ to +6 we knew that errors which occurred 
in the observations were negligible but we were doubtful about the assumption 
of linearity, then we should minimize B alone, which would lead us to limit the 
range of the observations so that the root mean square deviation +/c=6.589. 

It will be seen that the two extreme cases lead to widely different conclusions 
about the optimal choice of the spread of the design. To make further progress 
we must obtain some idea of what might be a “typical value” for ay in the 
usual situation where variance and bias both occur. Now it might be ex- 
pected that if the experimenter were graduating a function by a polynomial 
over a particular limited region R, he would try to choose the degree of poly- 
nomial so that, over the region considered, the average size of error arising from 
the bias part was at least no larger than the average size of error arising from 
the variance part and, in a typical case, things might perhaps be arranged so 
that these contributions were about equal in magnitude. To see what this 
would imply for the present example we write (25) in the form 


J = V(c) + an?B(e). (28) 


For every value of ay there will exist a corresponding minimizing value of c and 
corresponding values for V = V(c) and B=ay,*B(c). We shall proceed by choos- 
ing c from these minimizing values so that the value of a, makes V=B. 
Now in general we can choose c from the minimizing values so that the value 
of a; makes V=gB where g is any desired positive constant by minimizing the 
product {V(c)}*B(e). 
For, let c* be the value of c that satisfies 


a 
— [{V(c)}*B(c)] = 0, 
0c 


that is, at c=c*, 
+ {V(c)}*BY(c) = 0 
Provided the product g{ V(c) }*-!B(c)B’(c) #0, we have 
1 V(c*) V’(c) 


4 

1 
= — = 0.58 

(29) 
4 
(30) 

(31) 
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Thus equation (29) provides the values c* of c, and a* of a such that 


a 
(a) V'(c) + (c) = (V + B) = 0 (32) 


(b) V(c) — gan**B(c) = V — gB = 0 (33) 


Putting g=1 we find that the optimal values of ¥/c when V =B is 0.62, when 
the value of ay is 4.49. This compares with the value \/c=0.58 when variance 
is completely ignored. 

We are led therefore to a somewhat remarkable conclusion. This is that the 
optimal design in a “typical” situation in which the influence of bias and 
variance are equal is very nearly the same as that obtained when variance is 
ignored completely and the experiment is designed to minimize bias alone. 

It is admitted that the concept, that the experimenter would typically ar- 
range matters so that the average errors in the predicted response due to 
sampling variation and to bias were about the same, is not very precise and 
in practice considerable deviations from it would be expected to occur. It is 
seen from Table 630 however, that even in the case where the variance contri- 


TABLE 630. OPTIMAL VALUES FOR Vc WITH ASSOCIATED 
VALUES OF au 


(variance contribution completely dominant) 
0.72 1.82 (variance contribution four times that of bias) 
0.62 4.49 (variance and bias contributions equal) 

(bias contribution completely dominant). 


bution is four times the bias contribution the resulting root mean square value 
Vc is very close to the value obtained when variance is ignored completely and 
entirely different from the value obtained when bias is ignored. It will be seen 
in the following sections that similar results are obtainable for the case of k 
variables. Almost all previous investigators (notable exceptions being Hotelling 
[14], and more recently David and Arens [10], and Folks [12]), have been 
principally concerned with the minimization of variance. The recent work 
raises the possibility that this preoccupation with variance is mistaken and, if 
in some design problems a simplification of the situation had to be made, it 
might be better to ignore sampling variation rather than to ignore bias. 

3.2 The Effect of Bias When the True Model Is a General Polynomial. We 
have seen that the situation where all the discrepancy arises from bias may 
be of considerable importance since, in the case studied above, it approximates 
to the situation in which both variance and bias occur. It is of some interest 
therefore to consider the case where a linear function is fitted and all the dis- 
crepancy arises from bias but the true function is not necessarily quadratic but 
is a polynomial of any degree d, whatever. It appears as a special case of a 
theorem proved in Appendix 1 that the integral B which measures the average 
error arising from bias alone is minimized when all the moments of the design 


TERR 
\ 
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up to that of order (d,.+1) are equal to the moments of a uniform distribution 
over the region R. For example with the true model quadratic, all moments up 
to order 3 must be the same as those of a uniform distribution over the range 
[—1, 1]. This means that +c the root mean square deviation of the design 
points from the center of the region must be equal to 0.58 and the third moment 
[111] to zero: a result obtained already. In general, when the true model is a 
polynomial of degree d the i-th moment 


und (¢+1)-' «even 
This is intuitively a very reasonable result; in particular it implies that if bias 
only had to be considered and nothing whatever were known of the nature of the 
function n(x) except that it could be represented by some polynomial having 
an indefinitely large number of terms, we would do best by spreading the design 
points evenly over the region R. It is thought that a completely general poly- 
nomial in which high order terms are given the same weight as lower order 
terms is not a particularly realistic model. Usually it could be safely assumed 
that the function would have smoothness properties which would place greater 
emphasis on lower order terms so that, except in the case of complete ignorance 
as to the nature of the function, even spacing of the design points is not neces- 
sarily indicated as the most desirable solution, even for the case where bias is 
completely dominant. 
3.3 Requirement (b); True Model Quadratic. Having selected the best value 
of c for the design by use of criterion (a), we introduce criterion (b). 
To satisfy this criterion we must make large 


(34) 


N 
o~*E(Sr) — = [E(9.) — nu (35) 


u=1 


For this example of the fitting of a straight line with the true function quadratic 
we obtain, by use of (12), (13) and (35), 


[ana]? 


Now criterion (a) requires that [111] =0. It is interesting to note that criterion 
(b) requires this independently; for in order that (36) be maximized [111] must 
be equal to zero. We see then from (36) that, with +/c set equal to some suitable 
value decided by criterion (a) and [111] set equal to zero, criterion (b) adi- 
tionally requires that the fourth moment coefficient should be made large. 

It can be shown that the design of this kind that maximizes the fourth 
moment is obtained by placing (N —2) of the N points at the center of the in- 
terval and the remaining two points symmetrically about this center and equi- 
distant from it. Intuitively such an arrangement is not particularly acceptable. 
The reason can be seen if we remember that in our example it has been assumed 
that the true function is eractly represented by a quadratic polynomial. If this 
were not the case, and in fact a cubic term had to be included, then from 
Section 3.2, minimization of bias alone would require that the fourth moment 
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[1111] should be 4. This implies that the measure of “kurtosis” [1111]/c? for 
the design points should be 1.8 —a small value. The practical implication is 
that to get good detectability of quadratic departure from the assumed model, 
the design should be chosen so that the fourth moment was not small but the 
value should not be taken so large as to cause serious cubic bias in B. The pre- 
cise value chosen will depend on one’s relative anxiety to get good graduation 
on the one hand and good detectability of quadratic lack of fit on the other. 
The conclusions to be drawn from the application of criteria (a) and (b) to 
this simple case therefore are as follows. The design should be selected so that: 


(i) the third moment of the distribution of the design points is zero; 

(ii) the size of the design is such that the root mean square distance of the 
design points from the origin is approximately .60, with 2@ the width of 
the region over which a linear approximation is required. (This is very 
close to the value which would be appropriate if there were no experi- 
mental error and is completely contrary to the conclusion obtained when 
bias is ignored.) 

(iii) In order that a quadratic tendency in the true model can be readily 
detected, the fourth moment of the design should not be small. 


4. APPLICATION: CHOOSING A.DESIGN FOR FITTING A PLANE 


Suppose now that there are k variables £1, f, - - - , &, that the nature of the 
true relationship »=g(, @) is unknown but the experimenter believes it can 
usefully be graduated by a linear relationship over some region R of immediate 
interest. 

The fitted function is then 


4 = bo biz; + boxe (36) 


or, in matrix notation, 


j= xi by (37) 
where 
by’ (bo, bi, b.) 
x,’ (1, » Tk) 


The true relationship which applies over the whole operability region O is 
assumed to be some polynomial function which, for the moment, we will sup- 
pose is of some unspecified degree d2 in the k variables. 


n = Bo + Biti + + Bete + Buti? + - + Bears? 
+ Brtite + + + Bits? + (38) 
or, in matrix notation, 
n = + (39) 
where 


(Bo, Bi, Bx) 


x,’ = (1, %) 


Wie 
fi 
ke 
Ag 
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Be’ = (Bu, Boo, , Bex; Biz, Bis, » Bin, 


and each of the vectors 62’ and x,’ contains p terms, where 


dh +k 
= —(k+1 


We have from (7) 


N 
Ely(x) — 9(x) dx = V +B 
o R 


V = vo f xy’ xidx 
R 


N 
B= [A’x: — x2] — x2’ ]Godx 


where 
X = xiv } 
is a (k+1)XN matrix with xy’ =(1, Tau, * Lew), 
X,’ [xo1, Xen | 
isa pXN matrix with 
and 
A = (4 


is the (k+1)Xp “alias matrix.” This last matrix (for example, [8]) has, for 
its elements, quantities which measure the extent to which thee stimates b, 
are biased by higher order coefficients in accordance with the equation 


E(b:) = 1 + Abe. (46) 


Making the necessary substitutions we obtain 


k k 


i=0 R 


= f nn/dx) AQ, — 2a,’ (f AQ, 
R R 
+ ( f 
R 


| 
with 
and 
42 
B) 
(48) 
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where 


Oe 


a//N 
and are the elements of the matrix = 

We note in passing that, if we were concerned only with minimizing the 
average squared bias B, then as a special case of the result of Appendix 1 
whatever the nature of the region R and whatever the degree d2 of the polynomial 
describing the true response, for all values of 6: the bias term B is minimized 
when all the moments of the design up to order d,+1 are made equal to the 
moments of a uniform distribution over R. This result has been still further 
generalized to include weighting functions, in unpublished work by C. L. 
Mallows. In particular this implies, as for a single variable, that, if only bias 
had to be considered and if the rather unrealistic assumption were made that 
nothing whatever was known of the nature of the function over the region R, 
then we should do best by spreading the points evenly over R. 

4.1 Choice of the Region R. The specific results we obtain, whether from 
minimizing V, B or the whole integral J, will of course depend on how we 
define the region R. In the present paper we shall suppose that considerations 
of strategy (for example [1]) dictate the sequential exploration of subregions R 
entirely contained within O. Such a strategy is often appropriate, for example, 
in experiments designed to find and to explore a region in which some response 
or responses have optimal values (for example [8]). To give satisfactory ex- 
pression to the experimenter’s desire to use designs which symmetrically 
generate information, it seems reasonable to choose R to be a symmetric region 
in the coordinate system currently believed by the experimenter to be most ap- 
propriate. Of course the experimenter’s ideas as to what is the best coordinate 
system in which to work will almost certainly change as the investigation pro- 
ceeds. For example suppose he began his investigation with a simple 2? factorial 
in variables x; and x2. This would seem to imply that he currently expected 
that the response surface could be conveniently represented in an orthogonal 
coordinate system (21, 22) w:th the scales of measurement for the two variables 
proportional to the step size in the factorial design. In fact (and unknown at 
this time to him) when so represented the response surface might be highly un- 
symmetrical; for example it might have a ridge-like appearance. In such a case, 
as he proceeded iteratively from one group of experiments to another, the in- 
formation built up about the response surface would probably lead him by a 
process such as is already described [4] to employ, in later stages, transforma- 
tions and changes of scales and metrics in the variables in terms of which the 
response surface could be more symmetrically and simply described. At any 
given stage he would work with that coordinate system which his experience 
had so far led him to believe would provide the simplest and most symmetric 
representation of the response surface. Later experience would usually show 
that his ideas were capable of improvement and would lead to modifications. 

We shall try to select designs therefore which generate information sym- 
metrically in that coordinate system currently thought to be best and we will 
interpret “symmetric” to mean that the region R measured in this coordinate 
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system is a sphere. Adopting this convention the particular case which we 
study further in this paper is that where a linear function is fitted over a 
spherical region R and the true model involving p=3k(k+1) extra constants 
is in fact quadratic over the whole of the operability region O. 

4.2 Requirement (a); True Model Quadratic and Region R Spherical. We as- 
sume that the “center of gravity” of the design points is at the center of the 
region R, defined by 


where 


Now for such a region 


R 


+> *) r(* + *) 
2 2 2 

k 

DX (6: + 1) 

i=l +1 


2 


unless any 4; is odd, when the value of the integral is zero. 
Hence 


1} 


f = 
R 


and with x2’ now the vector of quadratic variables 


0 | (k + 2)—'h 


je | 0 
of = (k + 2)-'| 
0 | 0 


+ jeje’ | 0 
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where j,’=[1 1---1] and J, is the unit matrix of order k, 


‘ 

= 

wee k 

1 

z. 
i 

(49) 
S; 
in 
(60) 
(52) 

(53) 


636 AMERICAN STATISTICAL ASSOCIATION JOURNAL, SEPTEMBER 1959 


4.2a Vanishing of the Third Order Moments. Now with X2 a matrix of quad- 
ratic variables, the alias matrix A appearing in (48) can be partitioned after 


the first row as follows: 
A= | | (54) 
= 


where Ai=(cu, +++, Ck} Ci, Cane) is a 1Xp row vector of second 
moments, with 


N 
cy = > ju 
u=1 


and A; is a kXp matrix of linear combinations of third moments only. Using 
(50) to obtain the values of the integrals in the equation (47) for V and sub- 
stituting (51), (52), (53) and (54) in the equation (48) for B, the elements of 
the integral J = V+B for this case of a plane fitted to a truly quadratic surface 
are found to be 


V =1+ + 2) (55) 


i=l 


k k k k 2 
2k +2) > 2( aw) 
+ t=] jei+l i=l (56) 
(k + 2)*(k + 4) 
where 
ij 
= and the aj = 
(o/-/N) 


meagure the size of the second order constants §;; relative to the sampling error. 
Now third moments [hij] enter J only in the first term of B and from the form 
of this term we see that, if {con} is non-singular, J is minimized with respect 
to the [hij] for general a;; and whatever the c”, if all the [hij] vanish. We 
proceed therefore in what follows by assuming that all third moments have 
these optimal zero values. 

With third moments all zero, the expression J contains only the elements of 
a, and the second moments c¢;; of the design. If the z’s were random variables 
these c;; would be the variances and covariances of the k-variate distribution 
of the z’s and they will be so called. Now, whereas V does not contain a, the 
integrated bias B is a quadratic form in the elements of a. In general then, if 
contributions from bias and variance are considered simultaneously, values of 
ci; Which minimize J will depend on the elements of a, that is they will depend 
on the size of the quadratic parameters §;; relative to the sampling error. Before 
studying this general case further we first consider the extreme cases of com- 
pletely dominant variance and completely dominant bias. 


Aly 
: 
pred 
hae 
! 
| 


RESPONSE SURFACE DESIGN 637 


4.2b Case of Completely Dominant Variance. In the case of completely domi- 
nant variance, the model is assumed to fit perfectly and there is no contribution 
from the component B. The integral J is minimized by minimizing V. In 
general, designs chosen to minimize V will extend to the limits of the oper- 
ability region O. 

For example suppose the size of the design is limited by limiting the variances 
of the k-variate distribution of points so that 


t=1,---,k (57) 


with y; suitably chosen constants. It is shown [13, 16, 19, 2] that V is minimized 
when 
Ci 


) 
58 
Cij 0 all 1, J J ( ) 


Thus in these circumstances we should use a first order orthogonal design, that 
is a design which is such that the column vectors of the design matrix have zero 
inner products one with another and with a vector of ones. This orthogonal 
design would be chosen to have largest possible values for the “variances” of 
the 2’s. 

4.2c Case of Completely Dominant Bias. In the other extreme case of com- 
pletely dominant bias, where the model is not assumed to fit perfectly and the 
variance contribution approaches zero, J is minimized by minimizing B. From 
inspection of equation (56) with the [hij]=0, or from the general result of 
Appendix 1, the minimizing values for the c,; are seen to be 


(59) 


cx = ec = (kK + 2)" 
cy = 0 all 


Thus to minimize bias alone over the spherical region R, the design must be 
orthogonal with third moments all zero and the “variances” of the z’s all equal 
to (k+2)-!. What is implied concerning the size of the design can be seen as 


follows. If 
k 1/2 
( 


is the distance of the u-th design point from the center of the region R, then for 
the optimal design 
k N k 


N 


k+2 + 2 i=l u=1 i=l u=1 


where # is the root mean square distance of the experimental points from the 
origin and so equals \/c if k=1. Thus for minimal bias * the root mean n square 
distance of the experimental points from the center of R must be V/k/(k+2) 
times the radius of R. 

It is interesting to see that the orthogonality condition arises here as a result 
of minimizing bias alone over a spherical region. In previous investigations the 
conclusion that an orthogonal design was optimal has usually been arrived at 
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by minimizing only variance. The result concerning minimization of bias alone 
is of some practical interest in certain problems of numerical analysis [4, 3] 
where there are no sampling errors but planarity assumptions may not be com- 
pletely justified. 

4.2d Intermediate Case of Contributions from Both Variance and Bias. In 
practice usually both variance and bias occur simultaneously. Now for known 
a;; the optimum values of the c;; could readily be found; usually however, the 
a,; are unknown. In these circumstances we can make some progress by pro- 
ceeding in the following way. In practice we usually do not know the nature 
of the response surface with which we are dealing; in particular we do not know 
the orientation of the response surface with respect to the design. We proceed 
therefore by taking the average value of / over all orthogonal rotations of the 
response surface denoting the average thus obtained by J. Since V does not 
contain any of the constants of the response function, such a process leaves 
the value of (55) unchanged. The quantity B however is a quadratic form in 
the elements of the vector a: or equivalently of the vector 62. The average value 
of B over all rotations of the surface is obtained by substituting the averaged 
values over all rotations of the products a,,a;; which occur in (56). The details 
of the averaging are given in Appendix 2. Using these averaged values, and 
with the optimal value of zero substituted for all the third moments we obtain 


J=V+B 
k k 1 2 


i=l i=1 k + 2 


k k 1 1 k 
+b> (« - + 2(a — b) 


t=1 j=l k + 2 t=] j=l 
+ k(k + 2)-*(k + 4)“ [kik + 3)a — (k — I(k + 4)d], (61) 
where 
Now V is minimized with respect to the c,;; ({#7) when C=N-"(X'X) is 
diagonal, i.e. when 
c; = 0. 
But these are also the values of ¢;; (i+) which minimize B; hence minimization 


of J for variations in ¢;; ({47), irrespective of the values of the c,;, is achieved 
when 


cy = 0. (62) 

This implies that 
= (63) 
After substituting (62) and (63) in (61), differentiating with respect to the 


cy (i=1, 2, --- , k) and equating the result to zero, we obtain k equations the 
i-th one of which is 
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1 


j=l 
Subtracting the g-th such equation from the 7-th then gives 
(k + gg? (Cis? — + — Cog) = O (65) 
which, since the c;; are positive, implies that 
Ci = Cog =e = 1,2,---, k) (66) 


We are thus led to the conclusion that the criterion J averaged for all orienta- 
tions of the response surface is minimized when the design is a first order orthog- 
onal design with all third moments zero and with the variances of the 2; all 
equal. 

4.2c Summary of Results. We have shown then that if the surface is truly 
quadratic over the operability region O and a linear model is fitted over a 
smaller spherical region within O defined by >o27,2<1: 


(i) V+B is minimized, for all 8,;; and c;;, when the third moments [ijk] of 
the design are chosen to be zero. 

(ii) V alone is minimized, for O defined by c;;<~y;, when the design is chosen 
to be first order orthogonal with c;;=;, that is with the design of maxi- 
mum possible size. 

(iii) B alone is minimized when the design is chosen to be first order orthog- 
onal with all third order moments zero and ¢;;=1/(k+2). 

(iv) V+B is minimized if values of 8,;; averaged over all rotations are sub- 
stituted, when the design has all third order moments zero and is first 
order orthogonal with the c,; all equal. 


These conclusions suggest that we should accept that the best design to use 
in the circumstances implied by our assumptions is first order orthogonal with 
the c;; equal and with all third order moments zero, that is such that 


Ci =¢;;=c (alliand); 
lijk] = 0. 


Adopting the terminology used before [8] we shall call designs of this class first 
order orthogonal designs of type B. 

4.3 Choice of Optimal Size for First Order Orthogonal Design of Type B. The 
question remains of the optimal size of the first order orthogonal type B design. 
This involves the choice of c or equivalently of *=/ke, the root mean square 
distance of the experimental points from the centre of the design. We have 
seen that for minimization of V alone the size of the design should be as large 
as possible whereas for minimization of B alone ¢ the root mean square distance 
of the experimental points from the center of R should be Vk/(k+2), that is 
smaller than the radius of R. We need to reach some compromise in the practical 
situation where neither the contribution from variance nor that from bias can 
be ignored. 
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The variance and bias functions for any first order orthogonal design of type 
B in the general case where no assumptions are made concerning the nature of 
the 8;; may be found by setting e;;=c,;;=c (all i and j); ¢;;=0 (¢#j) and 
[ijk] =O (all 7 and J) in (55) and (56). We find 


V=1+—— (68) 


2k +2) doa? 2( asi) 
i<j 


(6 
(k + 2)?(k + 4) 


Now consider a symmetric kX k matrix @ for which the 7-th diagonal element 
is a,;; and for which the element occupying the intersection of the i-th row and 
j-th column is }a,; (i#j) and aj;=a;,. We have 


1 
tra = tr = > + ai;*. 
i 
Writing 
= tr{a?} = (tr a)*/tr{a?} (70) 


we have finally for any first order orthogonal design of type B 
J=V+B 


k 1 \?. 2%k+2-¢) 
{i+} + - )+ (71) 
(k + 2)e k+2 (k + 2)?(k + 4) 
Remembering that a;;=8;;\/V/c, the matrix @ is seen to be a simple multiple 
of a symmetric matrix whose diagonal elements are the quadratic coefficients 
8; and whose off-diagonal elements are one-half the interaction coefficients 8; 


of the true surface. Suppose the k latent roots of this latter matrix are 
Mi, Az, Ax. Then 


dA (72) 


ll 


j k l k k 
(tr w)?/tr{a?} = (Law? / \ 62 


ix} 
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Now the \,’s in the expression above are simply the coefficients in 
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the canonical form containing no product terms, into which the quadratic part 
of the true model 


k 
tol 
can be transformed by orthogonal rotation. These A; thus represent the quad- 
ratic effects of the canonical variable X; in the new coordinate system and their 
signs and magnitudes determine the characteristics of the true response surface. 
For example, if all the \; are negative, the surface has a maximum with ellips- 
oidal contours, the length of the 7-th principal axis being proportional to \;—". 
If one or more of the \; approaches zero, a line, plane or hyperplane of maxima 
results, while if certain of the \; are positive various minimax situations occur. 
The quantity @ is therefore seen to be an overall measure of the magnitude 
of the quadratic tendency of the surface relative to the sampling error. The 
quantity ¢ on the other hand is a homogeneous function of degree ‘zero in the 
\’s, being independent both of the sampling error and of the absolute magnitude 
of the \’s. It measures the “state of conditioning” (for example [5]) of the 
quadratic surface as evidenced by the variation among the )’s. In fact 


g =k/(1 + Cy’) (74) 


(A; — 4)? 
Cy X (75) 


is the coefficient of variation of the 2’s. 

If all the \’s were equal and of the same sign corresponding to the best state 
of conditioning of the surface (which would then have spherical contours) C) 
would be zero and ¢ would take its maximum value of k. At the other extreme, 
if the \’s were of mixed signs and }~\;=0, C, would be infinite and ¢ would 
take its minimum value of zero. If it so happened that p of the \’s were equal 
and of the same sign and the remainder were zero the value of ¢ would be equal 
to p. Thus in this case p would be a measure of the number of non-redundant 
canonical variables. The latter situation approximates to that found in prob- 
lems where the eventual objective is the location and exploration of maxiria. 
Here, the most common situation is that the true response surface is approxi- 
mated by a system in which a point, line, plane, or space of maxima oceur. For 
such examples p of the \’s would be negative and the remainder zero. 

The optimal value of c and hence of # = (ke)'/*, the root mean square distance 
of the experimental points from the center of the design, can of course be cal- 
culated for any given values of @ and ¢ by finding that value which minimizes 
J in (71). As we have seen however, the optimal value of 7 can take any value 
between « and V/i/(k+2) depending on the values chosen for 6 and ¢. To 
make further progress we need to determine what might be typical values for 
these constants and to do this we may proceed in a manner similar to that 
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adopted in the one-dimensional case. We have seen that the constant ¢ 
measures the state of conditioning of the response surface and values of par- 
ticular interest lie between the limits 1 <¢<k. Now for any fixed value of ¢ the 
integral J is of the form: 


J = V(c) + 6B(c) 


As before, by minimizing the product {V(c)}*B(c) we can choose c and hence 
* from these minimizing values so that V =gB, that is so that the value of @ 
makes the contribution V = V(c) from variance any desired multiple g of the 
contribution B=@B(c) from bias. The case g=1 is of particular interest since 
it is plausible that an experimenter attempting to graduate a function over 4 
particular limited region might make his choices of the size of region and degree 
of polynomial so that the average size of error arising from bias was about the 
same as that which he thought might arise from variance. 

Table 642 shows the value of # for g=k, k—1, - - - , 2, 1 and 0.2; for k=2, 3 
and 5, for the four cases. 

V_ Variance completely dominant 

V=4B Variance contribution four times that of bias 
V=B Variance and bias contributions equal 

B Bias completely dominant. 
Considering first those entries for which the variance and bias contributions 
are equal, it is seen once more that over the important range 1 <y<k the values 
for ¢ are very little larger than those obtained if the variance contribution is 
ignored entirely, and that furthermore, even if the variance contribution is 
made as large as four times the bias contribution, the value of 7 still remains 
comparatively small. 

As ¢ approaches the value 0 the optimal value of * becomes larger and even- 
tually goes to infinity. The reason for this apparent anomaly is easily seen. If 
@ is not infinite, a zero value for ¢ implies that }-\;=0. Now >= DOB 
is proportional to the bias in the estimate bo of the constant term 8. With the 
particular class of designs considered bp is the only biased estimate, all the other 
estimates b,, be, - - - , b, being completely unbiased by second order terms. It 
follows that for the particular case in which }>A;=0 and hence g=0, no second 


TABLE 642. OPTIMAL VALUE FOR # WITH ASSOCIATED VALUE OF /@ 


3 4 5 


| 


= 
: 
at 
= 
ve 
2 1.34 | 1.54] 1.00] 2.00] 0.84] 3.07 
0.99 | 4.73 | 0.79 | 6.23 | 0.74] 8.10 
0.71} © | 0.71 | 0.71 2 
0 0 0 © 0 
3 1.36 | 3.11 | 0.99] 4.80 | 0.87 | 6.56] 0.82] 8.75 
1.01 | 9.58 | 0.84 | 12.55 | 0.80 | 15.18 | 0.79 | 18.97 at 
0.77| |0.77| © |0.77| |0.77| 
| 0 0 0 0 0 0 
5 2.05 | 1.89 | 1.61] 1.87 | 1.33 | 2.59] 1.09 | 4.70] 0.97 | 7.32] 0.91 | 10.65 
1.30 | 9.02 | 1.01 | 12.15 | 0.93 | 14.57 | 0.89 | 17.04 | 0.87 | 20.20 | 0.86 | 25.13 2 
0.85| = |0.85| |085| © |0.85| |0.85| © |0.85| « 
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order bias exists in the estimate and consequently the optimal design (which 
is now that which minimizes variance alone) is of infinite size. This case )-\i=0 
is of course very atypical and we are left with the conclusion that in cases likely 
to be met in practice the optimal orthogonal design of type B should usually be 
chosen so that 7 is somewhat greater than /k/(k+2) but rather less than unity. 

4.4 Requirement (b); True Model Quadratic. In accordance with the plan out- 
lined at the beginning of the paper, we will choose from among that class of 
designs which satisfy requirement (a) that sub-class which best meets require- 
ment (b). It will be recalled that this latter requirement is designed to ensure 
that the experimental arrangement should allow a check to be made on the 
representational accuracy of the assumed class of polynomials, The require- 
ment is met by choosing the expected value of the residual sum of squares Sz 
to be large. Before going ahead with this plan we make a small digression to 
show that if, following the indications of requirement (a), we choose the first 
order design to be orthogonal, then requirement (b) independently implies that 
all third order moments should be zero. 

4.4a Independent Vanishing of Third Order Moments. Writing 


Xiu’ = (1, Viu, Teuy °° au) (77) 
we have from (9) 


E(Sr) — vo? = >> Be'(A’x1u — A — Xen’) Be 


= A’ X1uXiu' ABe 26,’ XouXiu + Be XouXou' Be (79) 


Now if the design is orthogonal N-! x1uxi/ = N-1(X,/X;) isa (k +1) X(k+1) 
diagonal matrix with first diagonal element unity and remaining diagonal ele- 
ments equal to c. The matrix 


Zz XouXin = | M;| 


is pX(k+1) and may be partitioned after the first column into two sub- 
matrices M, and M;, with M, a column vector containing p elements the first k 
of which are equal to c and the remainder to zero, and M; having each element 
a third order moment. Finally the elements of the matrix 


N-} XouXeu’ Xo = 


are all fourth order moments. Substitution in equation (79) and division by N 
defines a quantity 


F = N-{E(Sp) — vo?} = — — + (80) 
k 2 k 2 
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Third order moments are involved only in the second term whence it is seen 
that for any first order orthogonal design, F is maximized with respect to third 
order moments for all values of 8,; when all these third order moments are zero. 

44b Size of Fourth Moments. We now return to the main theme. From the 
infinite class of first order orthogonal designs which have all third order mo- 
ments zero and for which 7 is fixed at some specific value chosen on criterion 
(a), we have to choose a sub-class which makes large 


k 2 k k k k 

We notice that the only quantities which determine the design and which are 
at our choice are the fourth moments [ghij]. In general the optimal values of 
these moments depend upon the values of the elements of 82 and usually these 
elements are unknown. We can however obtain a design which is “good on the 
average” by arguing as befcre that, in a case where all orientations of the 
response surface with respect to the chosen coordinate system are equally 
likely, it is reasonable to average F over all rotations. Using equations (2.5), 
(2.6), (2.7), (2.18) and (2.19) from Appendix 2, we obtain 


N 


+ 2) 


k hk hk 
+ 2{ [iii] + — (83) 


t=] 


(84) 


k 1/2 
tf. = { 


i=l 


is the distance of the u-th experimental point from the center of the region R 
and 


N 1/2 


u=1 


is the root mean square distance. 
The above expression can be written 


2 
2 k k 


where G(r) is a fourth moment coefficient 


of the distances ry. 
Also we can write 
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where C2 is the coefficient of variation of the r.2. Cz takes its minimal value 
of zero and G(r) its minimal value of unity when all the experimental points 
are equidistant from the center of R. We can see therefore from (85) that the 
detectability of quadratic discrepancy from the assumed planar model is in- 
creased as the distances of the experimental points from the center of the 
region R have greater and greater “percentage variation.” 

With this in mind we can rewrite the expression in the form 


Now, since 0<¢<k, both terms within tne bracket are necessarily non- 
negative. For designs in which the points are all equally spaced from the center 
the second term vanishes. For ¢ ven k and with # fixed by criterion (a) the size 
of o-?F then depends only «» «ae value of 6(k—¢) and in particular takes the 
value zero when g=k. To see the reason for this we note that when g=k the 
contours of the response surface are spheres and the response function is repre- 
sented by a second degree expression containing quadratic terms only and no 
interaction terms. Now if all points of the design are equally spaced from thx: 
center it is easily shown that F' in equation (82) and hence the expected value 
of the residual sum of squares contains only interaction terms. Second order 
effects of a purely quadratic nature are therefore undetectable. 

As ¢ becomes smaller we encounter surfaces which cannot (except in particu- 
lar orientations) be represented by quadratic terms alone and consequently 
the power to detect interaction terms becomes on the average more and more 
valuable to us. When the points are not all equally spread from the center, F 
is non-zero even if =k, because unequal values for the r, render pure quadratic 
effects detectable. 


5. SOME DESIGNS WHICH MEET THE REQUIREMENTS 


We now consider one particular way of generating first order designs which 
satisfy requirements (a) and (b) and which, in accordance with requirement 
(c), do not contain an excessively large number of points. 

5.1 Requirement (a). Designs satisfying requirement (a) must be first order 
orthogonal with third order moments zero. One simply class of designs of this 
sort can be generated as follows. Suppose k factors are to be investigated in N 
trials where N is even and k<4N. We first write down any (}N Xk) matrix 
Z,= {2,;} having orthogonal columns so that 

N/2 
u=l 


N/2 


> zu? = 


the value of c being chosen on criterion (a). We now take for our design the 
NXk matrix 


(89) 
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This resulting arrangement satisfies the requirements for a first order ortho- 
gonal design and (compare [8, Appendix 2]) the third moments of the design 
are all zero because the NX4$k(k+1) matrix Z, of independent variables 


2, 
and consequently the matrix of third order moments is null since 
2, 


Now [2], if the levels of the factors are completely unrestricted and at our 
choice and k <4N, the original k vectors in the matrix Z, may be chosen to be 
orthogonal to a set of up to $N —k further vectors which correspond to extrane- 
ous systematic effects which it is desired to eliminate. These further vectors 
may for example represent block contrasts or they may be orthogonal poly- 
nomials corresponding to possible time trends. Finally, “spherical randomisa- 
tion” [2] may be employed to ensure that normal theory may be validly applied 
in subsequent statistical analysis. The design D obtained by replicating Z, with 
reversed signs will preserve all these properties and will possess the additional 
one (arising from the property that all third order moments are zero) that no 
estimate of a first order effect is biased by a term of second order, that is by 
a quadratic or interaction term. 

With k equal to 4N we obtain the designs of this kind which allow the in- 
vestigation of the maximum number of factors. These particular designs are 
such that the distances of the experimental points from the origin are all equal, 
for since Z; is a square orthogonal matrix 


tml 


Of particular interest are the designs of ‘his kind which employ only two levels. 
It was shown by Plackett and Burman [16] that a square mXm orthogonal 
matrix whose first column consisted of +1’s, and whose remaining columns 
contained an equal number of +1’s and —1’s, could be formed for m any 
multiple of 4. Using such a design for the matrix Z,, first order designs of type 
B suitable for testing up to four factors in eight trials, eight factors in 16 trials, 
twelve factors in 24 trials, ete., can be generated by replicating the basic design 
with reversed signs. 

When m is a power of two, the original Plackett and Burman design and the 
design of type B derived by replicating it with reversed signs are each frac- 
tionally replicated 2* factorials. The requirement that all third order moments 
are zero is equivalent to choosing the fractional replicates so that no three- 
factor interaction is included in the alias sub-group. That is to say, so that no 
two factor interaction is confounded with a main effect. Two level fractional 
factorials of this kind, thus provide a particular class of designs which satisfy 
requirement (a). These designs were derived some time ago [17, 18, and 8] 
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and have been extensively used (for example [15]). They can readily be genera- 
ated in the manner illustrated in the example which follows. 

Suppose we require a design to test eight factors in 16 experiments. We first 
generate an 8X8 orthogonal matrix whose first column, which we associate 
with the variable x;, consists of +1’s and whose remaining seven columns, associ- 
ated with the variables x2, 23, - - - , 2s, consist of equal numbers of —1’s and 
+1’s. These last seven columns can be obtained by writing out as column 
vectors the contrasts associated with three factors x2, x3, 24 run in a 2° factorial 
design together with their interactions. These interaction contrasts are then 
associated with the remaining factors, for example 25= 2203, = TsXa, 
= We thus obtain for the matrix 


8 
= 


-1 -1 
1-1 -1 
—1 
1 
1 


1 


— 


1 
1 -1l 
1 


The first order design of type B is then obtained by replicating this matrix Z; 
with all signs reversed. 

5.2 Requirement (b). We now need to try to satisfy the requirement that 
departures from the assumed model, which occur because the true function is 
quadratic rather than linear, should be readily detectable. 

The experimental designs we have discussed, in which k factors are tested 
in 2k trials by replicating a square k Xk orthogonal matrix with reversed signs, 
necessarily have points all equispaced from the origin. It should be noted how- 
ever that even if the points of the design are all equispaced from the origin the 
expected value of the residual sum of squares although minimal is not neces- 
sarily small except for special types of response surfaces. The arrangements 
considered do in fact provide quite sensitive tests for all but the rather excep- 
tional kinds of departure from linearity in which the true response functions 
are described in the original coordinate system by quadratic constants 8;; 
alone. These particular kinds of departures from linearity only become detect- 
able when Cz, the coefficient of variation of the squared radial distances of 
experimental points, is non-zero. If the basic design is to be modified by the 
addition of extra points, the greatest increase in C, can be achieved by 
adding no extra points at the origin. As an example, consider the n= 16 point 
factorial design of type B already discussed in Section 5.1. With no additional 
points at the center we obtain a design containing N =n+m points in all. 

Using this design the least squares estimates of the coefficients in the linear 
model have expected values which, on the assumption that the model is truly 
quadratic, are of the form 


1 1 1 
1 
1-1 -1 1 
1 
1 1 1 | 
‘ 
| 
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E(bo) = Bo + ¢ >, Bis 


E(b;) = B; = 1,2,---, 8) 


The expected value of the residual sum of squares which has 7+ degrees of 
freedom is 


N N? 
E(Sr) = (B11 + Bos + + Bes)? + { (B12 + Bss + Bas + Bs)? 


+ (Bis + Bes + Bar + Bes)? + (B14 + Bos + B37 + Bs)? 
+ (Bis + Bos + Bez + Bas)? + (Bis + Bas + Bss + Bsz)? 
+ (B17 + Bos + Bas + Bos)? + (Bis + Bar + Bas + Bas)?} + (7 + no)o?. 


The residual sum of squares Sz based on (7+) degrees of freedom can be 
divided into two parts, the first part containing 7+1=8 degrees of freedom 
associated exclusively with terms measuring components due to possible lack 
of fit and the second part S, containing m»—1 degrees of freedom associated 
exclusively with a measure of pure error obtained from the variation of the ob- 
servations recorded at the center point. The sum of squares for lack of fit 
having 8 degrees of freedom can be further sub-divided into 8 separate com- 
ponents S,, Se, S3, - - - , Sseach associated with a single degree of freedom. If we 
denote the observations at the center conditions by - , You, * * Yonoy 
their average by j, and the average of the n = 16 observations in the fractional 
factorial design by ja, we can write a detailed analysis of this residual sum of 
squares as follows. 

The seven sums of squares S2, S;, - - - , Ss correspond to the seven contrasts 
within each of which four interaction terms are confounded (for example 
[8, p. 14]). Even if no center points were added, so that S, did not appear, 
a fairly powerful test of the assumption of linearity would usually be possible, 
provided 

(i) an independent estimate of co? was available; 

(ii) the contours of the true response surface in the eight-dimensional space 

of the variables were not too nearly spherical; 

(iii) the “canonical axes” of the true system were not oriented too closely 

parallel to the coordinate axes of the variables; 

(iv) the interaction terms were not such that the estimated linear functions 

of them were all zero. 
Conditions (ii) and (iii) are essentially concerned with the possibility that the 
true surface can be described entirely by the quadratic terms 6,;, none of which 
appear in the expected value of the residual sum of squares when m=0. With 
m>O an additional term containing 


8 
Dd Bi 
i=l 


appears. As seen from the detailed analysis in Table 649, this is associated with 
the isolation of the contrast ja— jo, the difference between the average of the 
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ren: 
pete 
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TABLE 649. ANALYSIS OF RESIDUAL SUMS OF SQUARES 


Sums of Squares D/F Expected Values of Mean Squares 


no (=) (Ga — 50)? put’ +e 


N? 
+ Bss + Bas + Brs)* + 


2 N2 

(Bis + Bos + Bar + Bes)* + o* 

> + Bar + Bu + Bu)? + 0% 
8 aml 2 Bis 27 36 + Bas 


S. (You — 


ual 


center points and the average of the points in the fractional factorial design. 
The expected value of this difference is 


N 3 
> 
nN 


As no is increased above the value unity, the experimental arrangement be- 
comes more and more sensitive to discrepancies associated with the overall 
criterion of curvature 


5 
Bi. 
i=] 


In addition it becomes possible to isolate a sum of squares based on m—1 de- 
grees of freedom which, on the basic assumption of homogeneity of the error 
variance, measures pure error. This makes possible tests of departures from the 
linear model based solely on the internal evidence supplied by the design. 


6. DISCUSSION 


The class of designs to which we have been led, of which the fractional fac- 
torial discussed above is a member, seems to be excellently suited to the task 
at hand, indeed designs of exactly this type have for some time actually been 
applied in response surface studies. For the case of first order designs studied 
here the property of orthogonality which has repeatedly arisen is equivalent 
to that of “rotatability,” a concept introduced previously [6]. This agreement 
between the present theory and what has seemed desirable on an intuitive basis 
suggests that the present formulation of the problem is reasonably well con- 
ceived. By happy circumstance designs of the type discussed not only achieve 
the first three requirements of Section 1.2 (graduate the function as accurately 
as possible, allow check of representational accuracy of assumed form of poly- 
nomial, do not contain excessively large numbers of points) but also these 
arrangements ([6] for details) can form a nucleus upon which a satisfactory 
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design of order 2 can be built in case the assumed degree of polynomial proves 
inadequate, and they are ideally suited to become part of blocking arrange- 
ments in these larger designs. 

Much remains to be done and work is in progress on such topics as the effect 
of cubic bias in first order designs, the extension of the present ideas to second 
order designs, the effect of changing the criterion to the minimization of maxi- 
mum mean square error instead of minimization of average mean square error. 
The modifications necessary when a measure of the absolute value of the re- 
sponse is not important, but only the change in response from one point to 
another in the space of the variables, are also under consideration. 


APPENDIX 1 


A General Result Concerning Averaged Squared Bias. Suppose a polynomial 
of degree d, is fitted by the method of least squares over any region of interest 
R in the space of the variables when the true function is a polynomial of degree 
d,. Then B the squared bias averaged over the region R is minimized for all 
values of the coefficient of neglected terms, by making the moments of the 
design up to order d,+d, equal to the corresponding moments of a uniform 
distribution over the region R. 

We suppose that the fitted model is 


= (1.1) 
with b, the vector of least squares estimates, while the true model is 
n(x) = + (1.2) 
where 


Gi’ = (Bo; Bi, - +, Be; Bu, Bee; Bm, (1.4) 


contain all terms up to order d;, and x,’ and 6,’ are similar vectors containing 
all terms from order d,+1 up to order d2(d,:>d,). With observations at N points 
we can define the matrices X; and X; by 


Xu Xa’ 
X, = Xie. X = Xu’ (1.4) 
L mw’ L xen’ 


where the column vectors which make up X; are assumed to be linearly in- 
dependent. 
Now the component of the integral J arising from bias alone is B, where 


= af [E9(x) — n(x) = af Be’ (x:’A — — xe’) Bedx (1.6) 
= (1.7) 


where 


| 
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A = 
and 


A’wA — + vs 


Q f x1x;/dx 
R 

w= 2 f 
R 


us = 0 f X2Xo'dx. 
R 


Now write M,= X,'X; and so that 
A = 
Then 
A = (us — + — — 
+ we (1.14) 
= (us — we’ + (Mi M2 — ye)’ M2 — (1.15) 
=A:+ 4; (say) (1.16) 


Now it is shown below that A; is positive semi-definite as is y:. It follows that 
whatever the value of 62, 62’AG2 is minimized when M,~'M2=wy.~'w. and in 
particular when M,=y; and M;=e. Now the elements of M, and M; include 
all the moments of the design up to order d,+d, while the elements of y; and 
us include, in corresponding positions, all the moments of the region R up to 
order d,+d:. The stated result follows. 

To show that uw; and A, are positive semi-definite 


M2 


whence y; and 
Ui Ue 
are both positive semi-definite; but 


0 0 
(* ) )- (1.20) 
A; O ws — we’ 
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where 


I am 
T= ( — *) (1.21) 
. 


whence A; is positive semi-definite also. 


APPENDIX 2 


Rotational Average of the Constants in the Second Order Response Surface. To 
appreciate the character of the rotational average which is being taken, con- 
sider first the special case where k=2. We have for the quadratic part of the 
model ' 


Q = Buti? + Bats? + Bitte. (2.1) 
Now if we make an orthogonal rotation to new coordinates X, and X2 
= X, sin 6 + cos 6 
(2.2) 
= X, cos — sin 0 


then 
Q = {Bu sin? 6 + Bx: cos? 6 + Biz sin 6 cos o}X.? 
+ {Bu cos? 6 + Bx sin? 6 — By» sin 6 cos 0} X,.? (2.3) 
+ {2(8:1 — Bx) sin @ cos @ + Bi2(cos* @ — sin? 6)} XiX2 


The quatitities we are interested in are the values of the coefficients of X,’, 
X,? and X;X:2 averaged over values of @ from 0 to 2x. 

When k is greater om 2 we can deduce the form of the averages as follows. 
With 82’ = (B11, Bek, * » Be,x-1) the elements we wish to average 
are those of the aie 8282’. Now if the rotational averages of the elements are 
substituted in this matrix to give Bag.’ then x2 "FG x2 must be a function of 


k 
Lz 
inl 


only. 
This implies that 
Fa b 
b 
. 0 
b a 
2(a — b) 
0 ‘ 
= 
That is 
No~’8;;? = ai? = a (i = 1, 2, coe, k) (2.5) 


= aij? = 2a—b) G#j=1,2,---,k) 


ia 
(2.7) 
+ 
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with the remaining rotational averages zero. The quantities a and b are those 
substituted in equation (61). 

To determine the nature of these quantities consider again the symmetric 
matrix referred to in section 4.3 with diagonal elements §;;, off-diagonal ele- 
ments 38;,; and latent roots Ai, Az, - - - , Ax. We can express the a’s and b’s of 
equation (2.4) in terms of the )’s as follows. 

In all orthogonal rotations the trace of any power of the matrix of section 4.3 
remains constant. It follows in particular that 


i= Bis 
that is 


= > Bi; (2.9) 


i 


Now since (2.8) and (2.9) are true in every rotation, these equations are also 
true for the rotational averages. That is 


(XA)? = + he — (2.10) 
= + — (2.11) 
and, using (2.5), (2.6) and (2.7) 


= ka + k(k — 1)b 


No? > = ka + — — b) 


= (k + 1a — (k — 
Whence 


avd? /k(k + 2) 


+1) (Sd)? 2D a2} /(k — + 2). 


Whence, with 
N 


2 
and (Zn) / Da 


o 


+ 2) 
k(k + 2) 
+1)¢-2] 


(k — 1)k(k + 2) 


Also 
(2.13) 
2.15) | 
N 
2 
N 
(2.17) 
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REMARKS ON ZEROS AND TIES IN THE WILCOXON 
SIGNED RANK PROCEDURES* 


Joun W. Pratt 
Harvard University 


A Wilcoxon one-gample signed rank test may be made when some of 
the observations are 0 by dropping the 0’s before ranking. However, 
a sample can be not significantly positive while a more negative sample 
(obtained by decreasing each observation equally), is significantly posi- 
tive by the ordinary Wilcoxon test. The reverse is also possible. Two- 
piece confidence regions result. A procedure for avoiding these difficulties 
is proposed, namely to rank the observations including the 0’s, drop 
the ranks of the 0’s, and reject the null hypothesis if the sum of the 
remaining negative (or positive) ranks falls in the tail of its null dis- 
tribution (given the number of 0’s). If observations are tied in absolute 
value, their ranks may be averaged before attaching signs. This 
changes the null distribution. A sample may be significantly positive 
which is not significant if the observations are increased (unequally), 
or if the ties are broken in any way. 


1, THE REDUCED SAMPLE PROCEDURE FOR HANDLING 0’S AND ITS DISADVANTAGES 


UPPOSE one has decided to use the Wilcoxon signed rank test [9] to test the 
null hypothesis that a distribution is symmetric about 0, and one obtains 
this sample of 13 observations (arranged in order of absolute value): 


0, 2, 3, 4, 6, 7, 8, 9, 11, 14, 15, 17, —18. (1) 


The Wilcoxon test cannot be applied immediately, because of the 0. It is 
usually suggested following Wilcoxons advice [10], that the 0 be dropped before 
ranking, and that the Wilcoxon test be applied to the remaining observations. 
Hereafter this will be referred to as the reduced sample procedure. 

Dropping the 0 from sample (1) before ranking leaves 12 observations with 
signed ranks 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, —12 and a negative rank sum of 12. 
Under the null hypothesis, the probability of a negative rank sum of 11 or less 
in 12 observations is 55/2'? [7, Table 4]. The reduced sample procedure, then, 
would certainly judge sample (1) not significantly positive at any one-sided 
level a<55/2". 

One would now expect a sample of smaller observations to be judged not 
significantly positive (for these a). However, if the 0 is decreased (by an amount 
less than 2), or, indeed, if all observations are decreased (by an amount less 
than 1), the signed ranks of the new observations are 


—1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, —13, (2) 


and the negative rank sum is 14. Under the null hypothesis, the probability 
of a negative rank sum of 14 or less in 13 observations is 109/2'?, so the Wil- 
coxon test would certainly judge the new sample significantly positive at any 
one-sided level a>109/2". 


* This research was supported by the United States Navy through the Office of Naval Research, under con 
tract Number Nonr 1866(37). Reproduction in whole or in part is permitted for any purpose of the United States 
Government. 
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Thus, for 109/2%<a<55/2"=110/2" sample (1) would be judged not sig- 
nificantly positive but a more negafive sample would be judged significantly 
positive. 

The opposite anomaly can also occur. By the reduced sample procedure, the 
sample 


0, —1, —2, —3, —4, 5, 6, 7, 8, 9, 10, 11 (3) 


would certainly be judged significantly positive at any one-sided level a> 43/2". 
However, if 0, or all the observations, were increased slightly, the signed ranks 
of the new observations would be 


1, ~2, ~3, —4, —&, 6, 7, 8, 9, 20, 11, 12, (4) 


and the Wilcoxon test would certainly judge the new sample not significantly 
positive at any one-sided level a<87/2". Thus, for 43/2" =86/2"<a<87/2", 
sample (3) would be judged significantly positive but a more positive sample 
would be judged not significantly positive. 

This peculiarity of the reduced sample test procedure leads to an unpleasant- 
ness in the corresponding confidence procedure, which is obtained as follows. 
Assume the population from which the observations are sampled is symmetric 
about an unknown center of symmetry yu. Suppose one has originally a test of 
the null hypothesis 1.=0. One can test the hypothesis y=» by subtracting po 
from every observation and applying the original test. The values uo which 
would be accepted when so tested form a confidence region for the unknown 
center of symmetry u. 

If the original test is the one-sided Wilcoxon test rejecting the null hypothesis 
when the negative rank sum is k or less, the region is an interval extending to 
+ whose left end-point is the k+1-st from the smallest (algebraically) of the 
averages 3(X,+X,), i<j, the X; being the observations. This fact was noted 
by Tukey [7], and there is a convenient graphical procedure based on it 
[8, p. 446]. The two-sided Wilcoxon test leads to a similarly obtained finite 
right end-point as well. 

Strictly speaking, however, this is not the confidence procedure correspond- 
ing to the reduced sample procedure for handling 0’s. For sample (1), for in- 
stance, at a one-sided level a between 109/2" and 110/2", the interval of the 
last paragraph would have a positive left end-point, namely 1; yet, by the re- 
duced sample procedure, the hypothesis 1=0 was not rejected. Thus, in this 
instance, the confidence region would have two separate parts; it would con, 
sist of the point 0 together with an interval on the positive axis having left end- 
point 1. Similarly, for sample (3), at a one-sided level a between 86/2" and 
87/2", the interval of the last paragraph would have a negative left end-point, 
yet the reduced sample procedure rejected the hypothesis n =0; the confidence 
region would consist of an interval with an interior point (0) removed. (These 
peculiar confidence regions do not correspond to the reduced sample procedure 
for handling 0’s alone, but to this procedure taken together with the ordinary 
Wilcoxon test for samples without 0’s. The reduced sample procedure for 
handling 0’s does not, in itself, lead to a confidence procedure, since subtracting 
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uo from all observations will yield a sample without 0’s unless yo has one of a few 
isolated values.) 

One might also argue (purely heuristically) as follows. The unsigned ranks 
(the ranks of the absolute values of the observations) are not in doubt in the 
above cases. For sample (1), for instance, they are clearly 1 through 13 re- 
spectively. There is, to be sure, a question what sign to attach to the rank of 0, 
but this is not a sufficient reason to change the ranks of the other observations. 
And sometimes it makes no difference what sign is attached to the rank of 0. 
Thus, for a>109/2", sample (1) is significantly positive by the Wilcoxon 
criterion even if 0 is given signed rank —1, so sample (1) should be judged 
significantly positive. Similarly, for a<87/2", sample (3) is not significantly 
positive by the Wilcoxon criterion even if 0 is given signed rank +1, so sample 
(3) should be judged not significantly positive. We see that the reduced sample 
procedure sometimes gives a contrary conclusion. 

If the ordinary Wilcoxon test is to be used when there are no 0’s, the following 
three requirements on a procedure to be used when there are 0’s are easily seen 
to be equivalent, each seems intuitively desirable, and the above examples show 
that none is satisfied by the reduced sample procedure. 

(i) Increasing the observed values shall not make a significantly positive 
sample insignificant nor an insignificant sample significantly negative. 

(ii) Assuming that the distribution of the observations has a center of sym- 

metry u, those values of u which are not rejected shall form an interval. 

(iii) A sample shall be judged significantly positive if, when the 0’s are in- 

cluded in the ranking, the sample is significantly positive whatever 
signs are attached to the ranks of the 0’s; similarly for significantly 
negative and not significant. 

The examples above may appear more striking when it is noted that each 
displays two samples such that the more positive sample S has larger probabil- 
ity in the lower tail than the more negative sample S’, even though the observed 
value of the negative rank sum is excluded from the tail for S and included for 
S’. More precisely, S is a more positive sample than S’, yet the null probability 
of a negative rank sum smaller than that of S is larger than the null probability 
of a negative rank sum smaller than or equal to that of S’, if rank sums are 
computed by dropping 0’s before ranking. 

This property of the examples made it unnecessary to discuss what to do in 
the borderline situation when the probability in the tail is greater or less than 
a depending on whether the observed value of the negative rank sum is included 
in the tail or not; that is, when the null probability of a negative rank sum less 
than that observed is less than a but the null probability of a negative rank 
sum less than or equal to that observed is greater than a. 

Published tables [10, Table II, for instance] choose the critical value which 
makes the true significance level as near a as possible. A “conservative” pro- 
cedure would be to use the critical value which makes the true significance 
level as near a@ as possible but no larger than a. By either of these procedures, 
the anomalies occur for samples (1) and (3) for much wider ranges of a than 
were specified, and for many other samples, including some of smaller sample 
size. 
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In general, the following is a convenient way to express the anomaly, at 
least from a theoretical point of view. Let ¢,(S), ¢0(S), and ¢_(S) be the prob- 
abilities, under a test based on the signed rank sum and randomized to have 
level a exactly, of calling sample S significantly positive, insignificant, and 
significantly negative respectively. Then dropping the 0’s before ranking fails 
to satisfy the condition: 

(iv) #4 shall be a nondecreasing function of the observations, and ¢_ non- 

increasing. 
It is easy to see that this corresponds to (i) above. To show that it corresponds 
to (ii) as well would require more discussion of randomized confidence intervals 
than would be justified here. It is also equivalent to the randomized statement 
corresponding to (iii). 


2. A PROCEDURE FOR HANDLING 0’8 WHICH SATISFIES THE 
INTUITIVE REQUIREMENTS 


Suppose it is agreed to use the Wilcoxon test when there are no 0’s and to 
satisfy requirements (i)—(iv). Then the only question remaining is what to do 
if the decision given by the Wilcoxon criterion depends on the signs attached 
to the ranks of the 0’s, or equivalently if 0 is an end-point of the confidence 
interval for the assumed center of symmetry. Sometimes it might be enough 
just to say that this is the case. Tukey [7, p. 17] expresses this point of view 
as follows. 

“‘As usual, all difficulties can be resolved by saying that: 

(a) the interval with its endpoints has at least the indicated probability of 

covering the population center of symmetry, 

(b) the interval without endpoints has at most the indicated probability of 

covering the population center of symmetry.” 

If a test is required, however, the following procedure seems reasonable. Rank 
all the observations, including the 0’s, from smallest to largest in (absolute) 
magnitude; drop the ranks of the 0’s from consideration without changing the 
remaining ranks; compute the negative rank sum for these remaining ranks, 
that is the sum of the ranks corresponding to negative observations; and reject 
the null hypothesis if the observed negative rank sum falls in the lower tail of 
the distribution of all its possible values under random choice of signs. 

The test just described is, of course, one-sided. Substituting “positive” for 
“negative” gives another one-sided test. Rejecting the null hypothesis if either 
one-sided test rejects it gives a two-sided test (at double the significance level). 

For example, cons'der the sample (arranged in order of absolute value): 


0, 0, —2, —3, —5, 6, 9, 11, 12, 15, 16. (5) 

The signed ranks of the non-0 observations are 
—3, —4, —5, 6, 7, 8, 9, 10, 11, (6) 
and the negative rank sum is 3+4+5=12. There are 2°=512 possible assign- 
ments of signs to these ranks, of which 19 give a negative rank sum less than 


12. Therefore, for these ranks, the null probability (the probability under the 
null hypothesis), of a negative rank sum less than 12 is 19/512=.037. Thus, 
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at a one-sided level, a=2.5%, for inste ace, the null hypothesis would not be 
rejected. 

The reduced sample procedure, on the other hand, would drop the 0’s before 
ranking and consider the signed ranks 


=I, a, —3, 4, 5, 6, 7, 8, 9, (7) 


with a negative rank sum of 6. For these ranks, the null probability of a nega- 
tive rank sum less than 6 is 10/512 = .020, and the null probability of a negative 
rank sum less than or equal to 6 is 14/512 =.027. Using published tables of the 
critical point [10, Table II, for instance], one would reject the null hypothesis 
at a one-sided level a=2.5%. (Actually, to achieve this level exactly, it would 
be necessary, having observed this sample, to reject with probability .7.) Note 
that, in contrast, the left end of the ordinary Wilcoxon confidence interval for 
the assumed center of symmetry yu is —1, illustrating again the disadvantage 
of the reduced sample procedure. 

The two procedures may be related to each other as follows. The reduced 
sample procedure orders the possible samples by the negative rank sum T” 
computed by dropping 0’s before ranking. The procedure proposed here orders 
by another negative rank sum 7’, computed by dropping the ranks of the 0’s 
after ranking. If there are z 0’s and m negative observations in the sample, 
T’’=T'+mz. There is only one sample with m=0, and it is most extreme by 
either ordering. The next two samples are also the same by both orderings. 
Samples with large m are not in the lower tail by either ordering. This indicates 
that the two orderings do not differ greatly in the lower tail, nor, similarly, in 
the upper tail. 

The proposed procedure does indeed satisfy (i)—(iv). To see that it satisfies 
(iii) [(i) and (ii) are equivalent to (iii), as mentioned earlier], note that the 
negative rank sums obtainable by attaching signs to the ranks of the 0’s are 
exactly the integers from 7’’ to T’’+k, inclusive, where k=z(z+1)/2. Let T 
be the negative rank sum obtained by attaching signs at random to the ranks 
of the 0’s. T’’<7<T"’+k always. The null distribution of T is exactly the 
null distribution of the negative rank sum when there are no 0’s or ties. Hence, 
if T’’ is above the rejection point of 7’, it is certainly above its own rejection 
point, and if 7’’+k is below the rejection point of 7, T’’ is certainly below its 
own rejection point. (iii) follows by noting again that the negative rank sums 
obtainable by attaching signs to the ranks of the 0’s are exactly 7T’’ to T’’ +k. 
(iv) also follows. 

The reduced sample procedure, as has been pointed out, does not satisfy 
(i)-(iv). It might be modified by invoking (i)—(iv) whenever doing so leads to 
a decision, and dropping the 0’s before ranking only when the significance of the 
sample by the Wilcoxon criterion would depend on the signs attached to the 
ranks of the 0’s. However, this is not really a natural procedure, and the modifi- 
cation nullifies the real point of the reduced sample procedure, that existing 
tables apply. Indeed, finding or setting the confidence level for this modified 
procedure would involve consideration of the distribution of T’’ and in addi- 
tion, the conditional distribution of 7’ given the value of T’’. The extra com- 
plication arises because of the composite, or two-step, character of the test. It 
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does not arise when only the ranks of the 0’s are dropped because then, as 
proved in the last paragraph, (i)—(iv) are satisfied automatically. 


3. NONZERO TIES 


If some observations are tied in absolute value, it is usually suggested that 
one average their ranks before attaching signs. Hereafter this will be referred to 
as the average rank procedure. By this procedure, for instance, the sample 


1, 1, 1, 1, 2, 3, —4, (8) 

would have signed ranks 
2.5, 2.5, 2.5, 2.5, 5, 6, —7, (9) 
and negative rank sum 7. (2.5 is the average of 1, 2, 3, 4, the ranks of the 


four 1’s.) 

Note, however, that the null distribution is not the same as if there were no 
ties, even if the tied observations happen to have the same sign. There are 27 
possible assignments of signs to the ranks (9), of which 14 give a negative rank 
sum of 7 or less. Therefore, for these ranks, the null probability of a negative 
rank sum of 7 or less is 14/2’, so the average rank procedure judges sample (8) 
significantly positive at the one-sided level a=14/2’. 

Condition (i) is not satisfied here, nor is the following counterpart of (iii). 

(iii) A sample shall be judged significantly positive if it is significantly posi- 

tive however the ties are broken; similarly for significantly negative and 
not significant. 

To see that the average rank procedure for handling ties does not satisfy (i) 
or (iii), note that if the 1’s of sample (8) are increased by different amounts 
(less than 1), or if the ties are broken in any way whatever, the signed ranks be- 
come 


1, 2, 3, 4, 5, 6, —7. (10) 


The negative rank sum is still 7. However, the ordinary Wilcoxon test would 
judge a sample with these ranks not significant at the one-sided level a= 14/2’ 
since the null probability of a negative rank sum less than 7 for the ranks (10) 
is 14/2’. [This was the null probability of a negative rank sum less than or 
equal to 7 for the ranks (9). | 

At the level a= 14/2’, then, the average rank procedure judges sample (8) 
significantly positive, yet if the observations are increased appropriately, (i), or 
if the ties are broken in any way whatever, (iii’), the Wilcoxon test judges the 
resulting sample not significant. 

Samples for which the average rank procedure violates (i) and (iii’) at more 
conventional significance levels are provided by adding observations larger than 
4 to sample (8). 

The confidence region obtained from sample (8) by applying the average 
rank procedure, at the one-sided level a= 14/2’, is the interval from +1 to 
+. Note that, though this is an interval, so (ii) is satisfied, it is different from 
the ordinary Wilcoxon confidence interval, obtained by the method outlined 
in the second paragraph after (4), which has left end-point — 1/2. 
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It would be valid, and indeed “conservative,” to judge a sample significant 
only if the ordinary Wilcoxon test judges it significant however the ties are 
broken, or to use the ordinary Wilcoxon confidence interval with its endpoints. 
(See (a) of Tukey’s remarks [7] quoted at the beginning of Section 2.) What 
would require further justification is the simultaneous use of this criterion and 
another, such as that of the average rank procedure for handling ties. We can- 
not, for instance, employ the average rank procedure samples with ties of op- 
posite sign, but test samples with ties all of the same sign, such as sample (8), by 
blithely breaking the ties on the grounds that the result will be the same how- 
ever we break them. This would invalidate the average rank procedure for 
handling samples with tied observations of opposite sign. For instance, the 
computation of the null distribution of the negative rank sum obtained by the 
average rank procedure in connection with the sample 


—i, ~1, 1,38, 4 (11) 


is based on the assumption that sample (8) has signed ranks (9), not (10). 

One might try to modify the computation of the null distribution to allow 
breaking ties when all have the same sign, but averaging the ranks as in the 
average rank procedure otherwise. However, it will no longer be true that all 
assignments of signs are equally likely. If sample (8) is given signed ranks (10), 
for instance, it appears signed ranks (10) have greater null probability than 


~~}, —32, 3, —4, 5, 6, 7, (12) 


since these will not be the signed ranks of a sample, such as (11), obtained by 
assigning new signs to sample (8). Even if the null distribution could be com- 
puted, it is not obvious (i), (ii), and (iii’) would hold, and neither the average 
rank procedure nor the ordinary Wilcoxon tables would be applicable. 

When few ties are likely, the “conservative” version of the Wilcoxon pro- 
cedure, outlined at the beginning of the paragraph containing (11), is presum- 
ably satisfactory. When many ties are likely, however, this procedure presum- 
ably has low power, since it amounts to breaking all ties in favor of the null 
hypothesis. In this situation, the average rank procedure now seems to me to 
have the edge. (Theoretically, at least, one must be very careful to what extent 
one looks at the sample before deciding what procedure to use.) The average 
rank procedure does not permit use ’ ‘ the ordinary Wilcoxon critical values and 
confidence interval, and if does not satisfy (i) and (iii). However, if any 0’s are 
handled as proposed in Section 2, it does satisfy (ii) and the following condition. 

(i’) Increasing the observed values equally shall not make a significantly posi- 
tive sample insignificant nor an insignificant sample significantly negative. 

(ii) is equivalent to (i’). We will prove (i’) is satisfied by tracing the contri- 
bution of a particular observation to the negative rank sum of a sample S, a 
sample S’ obtained by increasing all observations of S equally, and samples 
R(S) and R(S’) obtained by making the same reassignment of signs to S and 
S’. Let s and s’ be the signed ranks o: a particular observation X of S and the 
corresponding observation X’ of S’, where the signed ranks of each sample are 
defined by the average rank procedure and 0’s are handled as proposed in 
Section 2. We will show X and X’ do not contribute positively to N(S) 
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—N(R(S)) —[N(S’) —N(R(S’))], where N( ) denotes the signed rank sum of 
whatever sample is in the parentheses. 

X’>X. If X'>0>X, then s’>0>s and X and X’ contribute s+s— [s’Fs’] 
<0. (If R assigns a plus sign to X and X’, the upper signs apply; if minus, 
the lower.) If X’>X>0 or 0O>X’>X, either R preserves both X and X’ and 
they contribute s—s— [s’—s’]=0, or R replaces both by their negatives and 
they contribute 2s—2s’. If X’>X>0, since increasing all observations equally 
cannot decrease the rank of a positive observation, then s’>s>0 and 
2s— 2s’ <0. Similarly, if O>X’>X, then 0>s’>s and again 2s—2s’<0. 

This shows no observation contributes positively. Therefore N(S) —N(R(S)) 
— [N(S’) —N(R(S’))] <0. It follows N(S )>N(R(S’)) if N(S) >N(R(S)), and 
N(S’)>N(R(S’)) if N(S)>N(R(S)). Since the test is based on comparing the 
observed signed rank sum N(S) with the signed rank sums N(R(S)) obtainable 
by reassignment of signs, it follows that S’ is significantly positive if S is, and 
S is significantly negative if S’ is. 


4, FURTHER REMARKS 


4.1 In the absence of ties, the null probability of a negative rank sum of 
7 or less in 8 observations is 19/2*: the null probability of a negative rank sum 
of exactly 8 is 6/2°. Hence the ordinary Wilcoxon test, randomized to have one- 
sided level a=24/2* exactly, rejects the null hypothesis with probability 5/6 
when a negative rank sum of 8 is observed. 

Thus, if the ties are broken in any way whatever, the sample 


1, 1, 1, 2, 2, 2, 2, —3 


is significant with probability 5/6 by the ordinary Wilcoxon test at the one- 
sided level a=24/2*. By the average rank procedure, it is not significant at 
that level, since 24 of the 2° assignments of signs to the ranks 


2, 2, 2, 5.5, 5.5, 5.5, 5.5, —8 


give a negative rank sum less than 8. 

The left end-point of the confidence interval is —1/2 by the average rank 
procedure; by the ordinary Wilcoxon procedure it is +1 with probability 5/6, 
—1/2 with probability 1/6. 

This example complements that of Section 3. 

4.2 (i) and (ii) are equivalent for the situation discussed in Sections 1 and 
2, but not in general. (i’) and (ii) are equivalent whenever confidence intervals 
are obtained as described in Section 1. 

The method of the proof concluding Section 3 does not apply to (i), since 
s’>s may not hold when tied observations are increased unequally. On the 
other hand, it continues to apply to (i’) even if the ranks are replaced by scores, 
the scores of observations tied in absolute value are averaged before signs are 
attached, and 0’s are handled as proposed in Section 2, provided score is a non- 
decreasing, nonnegative function of rank. 

4.3 If there are no 0’s or ties, the signed rank sum, on which the Wilcoxon 
test is based, equals }>;<; sgn (X;+X,), where the X; are the observations and 
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- sgn (y) =+1 or —1 according as y>0 or y<0 [7]. (The method of determining 
confidence limits mentioned in the second paragraph after (4) follows from 
this fact.) If the convention is used that sgn (0) =0, >> :<; sgn (X:+X,) equals 
the signed rank sum computed by omitting the ranks of the 0’s as in the pro- 
cedure proposed in Section 2, and by averaging the ranks of observations tied 
in absolute value before attaching signs as in the average rank procedure of 
Section 3. The test based on this statistic can thus be seen (by an easy appli- 
cation of Tchebycheff’s inequality as done, for instance, by Lehmann [2, p. 169]), 
to be consistent against alternatives for which Pr[X;+X;>0] differs (in one 
direction if a one-sided test is used) from Pr(X;+X;<0] by at least a fixed 
amount for all 747. 

4.4 A commonly proposed method of handling ties in nonparametric tests 
is to average the values of the test statistic over the possible ways of breaking 
the ties. (We regard a 0 as a tie, and attaching a sign as breaking the tie.) 

For the Wilcoxon test, if there are 0’s , this method leads to dropping the 
ranks of the 0’s as proposed in Section 2, not to the reduced sample procedure 
of dropping the 0’s before ranking. If some observations are tied in absolute 
value, it leads to the average rank procedure of Section 3. 

4.5 Another commonly proposed method of handling ties in nonparametric 
tests is to break the ties at random. This reproduces the distribution under the 
null hypothesis of the test statistic in the case of no ties, but it introduces 
extraneous randomness which is unpleasant in itself and might reduce power. 
These objections are, of course, not conclusive. 

4.6 It can be argued that the outcome of an extraneous random experiment 
done to break ties is irrelevant to the matter under test, and hence that the 
inference made should be conditional on the outcome of this experiment; that 
is, the confidence level quoted should be the conditional probability, given the 
outcome of the extraneous random experiment, of rejecting the null hypothesis 
when it is true. (If we do otherwise, the argument might run, we are reporting 
an over-all level when we have enough information to know that, in the in- 
stance at hand, we are working at an entirely different level.) For instance, if 
all the ties happened to break “toward” (or “away from”) the null hypothesis, 
this argument would have us report that, because of the way the ties broke, 
we were in effect working at a stricter (or less strict) level than the nominal 
significance level. 

If this argument is accepted, it puts us back where we were before breaking 
the ties at random. 

A good explicit discussion of the general issue of conditional vs. uncondi- 
tional inferences has recently been given by Cox [1]. Sometimes the argument 
that inferences should be conditional is convincing, sometimes not. Sometimes 
a most powerful procedure is conditional, sometimes not. 

4.7 The following method of handling ties is, in a sense, the same as that 
of 4.5. 

Let ¢(S) be the probability of rejecting the null hypothesis when sample S 
is observed. In other words, ¢ is a randomized test. If ¢(S) is defined for all 
samples S without ties, define ¢(S’) for a sample S’ with ties as the average of 
¢(S) over all samples S arising from S’ by breaking the ties. (If these samples 
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S are not equally likely under the null hypothesis, the average will involve 
weights.) 

The method of Section 4.5 obtains this ¢(S’) by actually breaking the ties, 
and the argument of Section 4.6 against doing this moves me somewhat. 
Essentially the same argument could be applied to any randomized test, and 
in particular to that of the previous paragraph. I don’t find it compelling when 
so applied, perhaps because I don’t expect to carry out a randomized test, 
but merely to report ¢(S) for the sample observed. In this respect the test of 
the previous paragraph differs from that of Section 4.5, though the two tests 
have equal probability of rejecting the null hypothesis under all distributions. 

4.8 Paul Meier has suggested the following view of the situation. In the 
case of no 0’s or ties, order the possible samples of signed ranks according to 
their rank sum. The Wilcoxon test rejects the null hypothesis if the sample falls 
in one (or either) tail. If there is a 0, say just one, sometimes a finer measure- 
ment would distinguish whether it is positive or negative. However, with only 
our crude measurement, we do not know which of two adjoining samples of 
signed ranks we have. Thus the samples are lumped in pairs. It now seems 
reasonable to reject the pairs falling in one (cr either) tail, and this is just what 
the procedure proposed in Section 2 does. Changing the observations slightly 
will split the pairs up but not otherwise reorder them, so the peculiar confidence 
regions do not occur. The reduced sample procedure of Section 1, however, 
orders the pairs of samples differently from the unpaired samples. 

The same idea applies if there are more 0’s. The bunches of samples lumped 
together will be larger, but alike, and no new problem arises. 

If there are non-0 ties, however, the bunches are no longer alike, so that 
changing the observations slightly does not affect all bunches in the same way. 
Then a bunch may be significant by the average rank procedure of Section 3 
though it is entirely out of the tail (sample (8)). 

Figure 664 gives the upper tail of the ordering by signed rank sum (or the 
lower tail of the ordering by negative rank sum) of the possible samples of 
seven signed ranks. (1, 2, 3, 4, 5, 6, 7) is the sample of signed ranks which is 
most positive by this ordering, that is, it has the smallest negative rank sum. 
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It is denoted 0 in Figure 664 and, being most positive, appears farthest to the 
right. (—J, 2, 3, 4, 5, 6, 7) has the second smallest negative rank sum and is 
denoted 1. In general, ij - - - k denotes a sample with negative signed ranks 
—i, —i,-+-, —k and the remaining signed ranks positive. The bunches in- 
dicated arise when the smallest four observed absolute values are equal (and 
not 0) but there are no other ties. The sample —1, 1, 1, 1, —2, 3, 4, for example, 
upon finer measurement separating the first four observations, might have 
signed ranks (—1, 2, 3,4, —5, 6,7), (1, —2, 3,4, —5, 6,7), (1,2, —3, 4, — 5, 6, 7), 
or (1, 2, 3, —4, —5, 6, 7). These are accordingly in the same bunch; they are 
denoted 15, 25, 35, and 45. Sample (8), upon finer measurement, must have 
signed ranks (1, 2, 3, 4, 5, 6, —7). Accordingly, 7 is a bunch by itself. The 
anomaly discussed in Section 3 arose because there are 14 samples farther to 
the right than 7, but only 14 samples belong to bunches whose center is as far 
to the right as 7 or farther. 
4.9 <A referee makes the interesting remark that “the sample 


is significantly positive at the level 43/2", but if we keep sampling and come 
up with an additional positive observation }, then the sample is no longer sig- 
nificantly positive at that level [the new signed ranks being (4)]. The real 
culprit seems to be the arbitrary choice of rank sum (rather than any other 
statistic depending only on ordering) as the test statistic.” 

The ordinary ¢-test exhibits the same unpleasant property, however. In fact, 
by the é-test, sample (13) is significant at the (one-sided) level .0165, while the 
addition of a further positive observation, if it is small enough, gives a sample 
not significant at that level. 

In my opinion, this unpleasantness is not on a par with violations of (i)—(iv) 
of Section 1: in fact, in the framework of inference we are using, it is not sur- 
prising that a sufficiently small positive observation may provide more evidence 
of variability than positivity. 

4.10 It should be emphasized that almost nothing has been proved here 
about the “goodness” of any test. All that has been shown is that the test 
proposed in Section 2 satisfies intuitive conditions not satisfied by the reduced 
sample procedure of dropping 0’s before ranking. Comparisons could be made 
between procedures, but it would be hard at present to prove that the test of 
Section 2 has any optimum properties, since no such properties seem to have 
been found for the Wilcoxon test itself (except as a test of symmetry when the 
median is known to be 0 [3, p. 36]). 

Intuition to the contrary notwithstanding, it is not always true that larger 
observations are greater evidence of positivity in the population. For example, 
consider the density 


(.4, -l<2z<9Q, 
f(x) = 
0, otherwise. 


This is unquestionably the density of a “positive” population, since f(x) > f(—z) 
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for 0<2<1 and f(z)>f(—-2) for all x>0. Nevertheless, if a sample of 2 is 
drawn from this “positive” population, the signed ranks 1, —2 are more likely 
than —1, 2. I. R. Savage [6] gives further insight into this problem. Despite 
such examples, in the absence of information about the underlying distribution, 
conditions (i)—(iv) seem to me intuitively desirable. 

4.11 In his discussion of ties in nonparametric tests [5], Putter points out, 
“The usual practice ... has been to consider the conditional distribution of 
the statistics concerned given that the number of ties in each group is a fixed 
constant. This, however, was never explicitly made clear. . . . ” Accordingly, 
let. me now say explicitly that each test discussed—the {est proposed in Section 
2, the average rank test of Section 3, and the general method of handling ties 
(Section 4.4)—rejects values of its statistic falling in one (or either) tail of the 
conditional distribution of the statistic given the number of tied observations 
in each group (and the number of 0’s). 

For handling 0’s in the one-sided sign test, dropping the 0’s (that is, treating 
the sample as though the 0’s had not occurred) is the most powerful conditional 
procedure given the number of 0’s. However, dropping the 0’s, and averaging 
as in Section 4.4, lead to the same procedure in the case of the sign test, so 
this gives no clue to the relative merits of the reduced sample procedure of 
Section 1 and the procedure proposed in Section 2. 

Putter [5] shows that bre..':ing ties at random is less powerful than averaging 
(as in Seetion 4.4) for the one-sided sign test, and asymptotically less efficient 
for both the sign test and the two sample Wilcoxon (Mann-Whitney) test. 
These results suggest that, for the Wilcoxon signed rank test, breaking ties at 
random might be less powerful than the reduced sample procedure of Section 1, 
the procedure proposed in Section 2, and the average rank procedure of 
Section 3. 

Putter claims to prove that, in doing the one-sided sign test, dropping the 
0’s is the most powerful procedure “from the standpoint of the ties being 
random variables,” or unconditionally. However, his proof shows only that it 
is the most powerful conditional procedure. (It follows it is most powerful 
similar and therefore most powerful unbiased. A test is called “similar” if it 
has level exactly a for all distributions of the null hypothesis, and “unbiased” 
if its power is at least a for all alternative distributions.) This has now been 
mentioned elsewhere [4, footnote 6], but still seems worth mentioning here be- 
cause of the widespread misunderstanding of this point and a similar point 
about the “exact” test in 2X2 tables, also mentioned by Lehmann [4, foot- 
note 6]. 


REFERENCES 


[1] Cox, D. R., “Some problems connected with statistical inference,” Annals of Mathe- 
matical Statistics, 29 (1958), 357-72. 

[2] Lehmann, FE. L., “Consistency and unbiasedness of certain nonparametric tests,” 
Annals of Mathematical Statistics, 22 (1951), 165-79. 

[3] Lehmann, E. L., “The power of rank tests,” Annals of Mathematical Statistics, 24 
(1953), 23-43. 

[4] Lehmann, E. L., “Significance level and power,” Annals of Mathematical Statistice, 
29 (1958), 1167-76. 


| 
‘ 


ZEROS IN SIGNED RANK PROCEDURES 667 


[5] Putter, Joseph, “The treatment of ties in some nonparametric tests,” Annals of 
Mathematical Statistics, 26 (1955), 368-86. 

[6] Savage, I. R., “Contributions to the theory of rank order statistics—the one-sample 
case,” Annals of Mathematical Statistics, 30 (Dec. 1959). 

{7] Tukey, J. W., The Simplest Signed Rank Tests, Memorandum Report No. 17 (1949), 
Statistical Research Group, Princeton University. 

[8] Walker, Helen M., and Lev, Joseph, Statistical Inference, New York: Henry Holt, 
1953. 

[9] Wilcoxon, Frank, “Individual comparisons by ranking methods,” Biometrics, 1 
(1945), 80-3. 

[16] Wileoxon, Frank, Some Rapid Approximate Statistical Procedures, New York: Ameri- 

can Cyanamid Co., 1949. 


3 
| 
| 
‘ 


GRAPHIC METHODS BASED UPON PROPERTIES OF 
ADVANCING CENTROIDS* 


8. I. Asxovirz 
University of Pennsylvania and Albert Einstein Medical Center 


Short-cut techniques involving centroids of sets of points are ex- 
tended to least squares applications in line fitting and to determining 
the mean of a frequency distribution. 


ITHIN the past few years, a number of papers have appeared [1 to 7] in 
W wnia graphic methods were presented for carrying out various numerical 
determinations. These procedures consisted of a series of simple steps with 
pencil and ruler, involving no free-hand drawing or other subjective decision 
by the user. The basic concept underlying these techniques, that of the centroid 
of a set of points in a plane, has appeared elsewhere [3, 5] and need not be re- 
peated here. It is the purpose of the present article to demonstrate a few addi- 
tional techniques not published previously. 


1, LINE OF LEAST SQUARES 


In an article in this Journal some time ago [2], a method was described for 
drawing the best straight line through a series of points according to the least, 
squares criterion. The theorem upon which it depended could be proved directly 
from the normal equations for the line of least squares. What it implied in words 
was the following: If we assign to each of a finite series of points in a planea 
weight equal to its X-coordinate, and then locate the weighted centroid of these 
points, this center of gravity will always lie upon the least squares line of best 
fit for the given points. By proper selection of a temporary Y-axis, it was 
possible to simplify the weighting factors, and this led to a short graphic 
procedure. 

The previous examples refer only to data for which the X-coordinates were 
evenly spaced. However, the same principles are just as valid for unequal 
spacing of the points. When there are just three points, the method can be 
made quite short (Fig. 669). Given the three points A, B and C, project down- 
ward on to the X-axis, and mark on the edge of a card or ruler three dashes 
corresponding to the projection points Ao, By and Cy. Next, mark on the graph 
points A;, B, and C; directly above and below the original points alternately, 
so that AA,;= BoC», BB, =CoAo, and CC; = AoBo. Mark the point of intersection 
of A,B, with AB at 7; and the intersection of B,C, with BC at point U; then 
TU is the desired line. 

The proof follows readily from the following type of argument. Suppose that 
we had placed the Y-axis temporarily through the point A. Then applying the 
property of weighted centroids stated previously, the point A would acquire a 
weight of zero, and we would merely have to find the centroid of B with a 
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Heart Institute of the National Institutes of Health, and at the Ophthalmology Research Laboratory, Albert 
Einstein Medical Center, Northern Division, Philadelphia, under a research grant from the Weinstock Fund. 
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GRAPHIC METHODS 


weight of AoBo, and C with a weight of AoC». This is accomplished by dividing 
the segment AC into proportional parts, and the graphic procedure above has 
done just this. A simple check procedure is available from the fact that the 
intersection of AC and A;C; should lie on the line of best fit. This follows from 
an extension of Desargue’s theorem on projective triangles. 

For four points, a method only a little longer has been worked out. Two 
different methods have been developed for the general case, but they are too 
long to recommend in their present state. If an approximate solution is accept- 
able, one may replace the original data points by three representative centroids 
or “master points” [6] and then apply the graphic method above. 


AG, 


*B 
' 


Fia. 669. Drawing the least squares line 7'U for the three points A, B and C. 


It may be mentioned that the short-cut procedure published previously for 
evenly spaced z-values can be shortened even further. The observation was 
made that in proceeding to the right and to the left, according to the “two- 
thirds rule” [2], one may omit the point on either end and start advancing 
from the second. This is justified by taking a temporary axis through each 
extreme point, giving to it a weight of zero, instead of (as previously) taking 
an axis beyond the end-point and thereby giving the end-point a weight of one 
unit. An illustrative example will be given later (Fig. 672a). 

As a development of the above, a short graphic technique can be given for 
extending the line of least squares for an additional data point. For example, 
in Fig. 670, let us assume that the line TU represents the best fitting straight 
line for the points A to HZ. This may have been determined graphically or by 
the usual calculations. Now, let us assume that we desire to include the addi- 
tional point F, and would like to know the net effect upon the line of best fit. 
This is easily obtained by the following steps. 

Divide the horizontal extent covered by the original series of points into three 
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equal parts. In the diagram J and K trisect the segment AoE». Next, mark the 
points on the line of best fit (7U) above these trisection points, at P and Q. 
Join Q to the new point F and mark on QF the point R on a vertical line two- 
thirds of a spacing unit to the right of K. Then PR will be the best line for the 
set of points A through F. 

As a check, it would be well to join M, the mid-point of PQ, and F. Then 
MF should intersect PR at M’, exactly $s units to the right of M. Further- 
more, the Y-coordinate of M’ will be equal to the mean value for the entire 
series of points. The proof follows by the following line of reasoning: If we 
follow the two-thirds rule, starting with the point A, and apply it to the series 
A to E, we would arrive at a point two-thirds of the extent from A to E and 
on the line of best fit. This point must be none other than Q. Now, if we had 
applied the two-thirds rule to the series of points A through F, we would have 


By 


-F1a. 670. Given that TU is the line of least squares for points A, B, C, D, E, 
to work out the effect of an additiona! point F. 


arrived at Q as a turning point and then proceeded from Q to R in order to 
include the data point 7. Therefore, R must be a point on the line of best fit. 

To justify the presence of point P on the new line, let us consider the two- 
thirds procedure for the series of points Z to A, working from right to left and 
starting at the first point on the right, Z. This would lead exactly to the point 
P. Now, let us think of the six points F to A with the two-thirds procedure 
applied starting with the second point from the right, namely, E. Since the 
graphic steps would be precisely the same, we would arrive at the identical 
point P. Therefore, the graphic procedure is correct. 

It is interesting to note that the height of the point F does not alter the 
height of point P on the line of best fit. As the point F is lowered or raised, 
th- \-e VW would merely rotate about the pivot point P. 

“is procedure leads to an alternative method for constructing the line of 
best fit for a series of equally spaced points (Fig. 671a). In effect, we start with 
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‘ 


3 
Fig. 671a. Deriving the line of least squares by repeated addition of new points. 


the line AB as the best line for the points A and B, and then continue to extend 
the series of points one by one. Thus, we are led through the best fitting line 
L,- for the points A, B, C; Lea for the points A to D; and L,, for points A to E. 
We may examine at each step what effect the addition of a new point has had 
on the best fitting line for the antecedent points. 


2. MEAN VALUE FOR A FREQUENCY DISTRIBUTION 


Frequency diagrams are often published without any accompanying tabu- 
lation of numerical data, and sometimes even without a scale on the vertical 
axis. By applying the method of advancing centroids, it becomes possible to 
work out the mean value for the frequency distribution by an all-graphic ~ 
method. For example, starting with the histogram in Fig. 671b, let us take the 
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Fic. 671b. To find the mean value of a frequency distribution, given 
only a histogram, without any numerical data. 
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Y 


Fig. 672a. First part of graphic procedure to work out mean value: 
drawing a least squares line. 


mid-point of each bar and draw the line of best fit for these points, using the 
abbreviated method of starting with the second point in each direction and 
advancing by two-thirds of a spacing unit. If J and H are the centroids arrived 
at (Fig. 672a), we proceed through the following steps. Mark the projection 
points of the first and last points in the series on to the X-axis, at Ao and Gp 
(Fig. 672b). Join each of these points to the mid-point K of the segment JH. 
(Actually, K is the ordinary centroid of the whole set of points and its height 
represents the arithmetic mean of the y-values.) Draw a horizontal line from 
H, meeting AoK at M; and also JoP from J» to intersect GoK. Then M and P 
should be on the same vertical line, and this line projected downward on to 
the X-axis will provide the location of the mean value. 

A proof for the validity of this method may be seen as follows: Let us take 
a temporary Y-axis through the point A» and change the scale unit, if necessary, 


P 


N 


Fie. 672b. Final steps in graphic procedure. 
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so that the X-coordinates of the points now become 0, 1, 2, 3, - - - . It will be 
recalled [2] that the graphic procedure for locating H implies that H represents 
the weighted centroid of the points (z;, f;), with weights w;=2z;. Thus, we have 
tu = > > w; and yx = wi= Do The coordinates of K, 
the ordinary centroid, will be zx = and yx = )_y;/n. By similar triangles, 
we have zx/yx=xu/ym. Therefore, we may write the following equations: 


AoN = = 
yal xs/n)/( yi/n) 
=(Lefi/ 2) Ds 
= 


Transforming back to the XOY coordinate system will not alter the relation- 
ship of the point N to the series of points A, B, C, - - - , G. Thus, ON is the cor- 
rect mean value for the original frequency distribution. 
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OPTIMAL CONFIDENCE INTERVALS FOR THE VARIANCE 
OF A NORMAL DISTRIBUTION 


R. F. G. W. Kuerrt 
University of Washington 


Tables of divisors of the sample sums of squared deviations from the 
mean are provided to yield either the confidence interval of minimum 
length or the “shortest” unbiased interval for the variance of a normal 
distribution. Some questions are raised concerning confidence intervals 
of minimum length. 


1. INTRODUCTION 


HE notion of a confidence interval, introduced by Neyman, dates back to 

1930. Since that time one of the standard applications has been to the 
interval estimation of o”, the variance of a normal distribution. This problem 
is still worthy of attention due to the fact that the customary procedure for 
handling it, with the so called “equal tails” interval, is far from satisfactory. 
We shall present tables, prepared by high speed; computer, for an interval Jz, 
with a minimum length property, and Neyman’s “shortest” unbiased interval 
which will be denoted by Jsv. The latter was also tabulated by Rao [7, p. 223] 
for two confidence levels. In order to establish a basis for the short discussion 
following these tables, and to exhibit a unified computing approach, we will 
give a sketch for the derivation of the intervals used, for which we make no 
claims of originality. The calculations involved in the construction of the 
tables are sufficiently delicate that such procedures as ordinary linear inter- 
polation will not suffice. Our methods are described at the end of the paper. 


2. DERIVATIONS 
Let X;, Xs, - - - , Xw be a random sample of independent chance quantities, 
each having the distribution N(y, ¢?) with both » and o? unknown. The symbol 
I will denote a conndence interval (07(X1, X2, Xw), ¢7(Xi, Xo, --- , Xw), 
and «¢ the confidence coefficient. We will also need the chi-square density for n 
degrees of freedom: 


Sn(x) x> 0. 


(3) 
| — 
2 


Assume at the outset a restriction to intervals of the form! 
1 
(— - - x)'), (1) 


where n=N-—1. The discussion throughout the rest of the paper will center 
around confidence intervals of the form (1). The reader should realize, however, 


* Research sponsored by the Agnes Anderson Research Fund, University of Washington. 
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Laboratory, University of Washington. 
1 In all summations containing the index i it will run from 1 to N. 
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that it is only for convenience that we restrict ourselves to expressions con- 
taining >>(X;—X)*. The derivations and the tables apply equally well, for 
example, to any problem involving k independent samples from 2(u, o”). Just 
replace >>(X;—X)?, N, and n by 


k Nv k k 
> (xX. and DON, -k&, 


vel i=l 


respectively. 
A confidence interval of type (1) which is to have minimum length and con- 
fidence coefficient ¢ van be obtained by minimizing 


1 1 = bn 


with respect to a,, b,, \, where \ is a Lagrange multiplier. The resulting con- 
ditions for a, and b,, determining 7 yz, can be written as follows: 


Snsa(dn), 
ML: 
f = «. 


These conditions are equivalent to those given, for example, in the book of 
Anderson and Bancroft [1, p. 111]. Table 678 shows a, (the top figure) and 
b, for n=2 (1) 29, and e=.90, .95, .99, .995, .999. 

Neyman [5, p. 377] defined an interval I to be unbiased if 


P, covers S (2) 


I 


for all values of u, o?, and o’?. The “shortest” unbiased interval is that interval 
Isv which satisfies (2) and for which the left member is a minimum uniformly 
in p, o?, and o’?. In the Neyman paper cited above it i. also shown that a single 
confidence interval may be obtained from a family of acceptance regions of 
the hypotheses H:c?=o"*. In particular the interval associated with the family 
of uniformly most powerful unbiased tests of these hypotheses against the re- 
spective alternatives o? 0’? is the interval J sv. It is well known that an arbi- 
trary acceptance region of this family is given by the sample points which 
satisfy the condition 


1 
(«. X)?s bn) ; (3) 


see for example Lehmann [4, Sect. 5.3]. Following the procedure outlined by 
Neyman [5] one finds then that Jsy has the form (1). In order to determine 
a, and b, we observe that the acceptance region for the test of H:c?=0’* has 
an associated operating characteristic function 


= f fa(x)dz. (4) 


A necessary condition for the test to be unbiased is that A’(c’?)=0, which 


- 
q 
>: 
: 
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together with A(o’*) =e shows, in confidence interval language, 
= 


f 


These conditions were introduced by Neyman and Pearson [6, pp. 18, 19, 25] 
and were used by Rao [7]. Table 680 shows the a, and b,, values for Jsv, again 
with a, the upper figure and b, the lower figure. 

One can see after a little calculation that the likelihood ratio principle, ap- 
plied to a two-sided test of c?=0’, leads to 


= 
f 


The “equal tails” interval is given by 


= — 0), 


ET: 
f = «. 


I 


3. DISCUSSION AND EXAMPLES 


Additional conditions imposed on confidence intervals, beyond their having 
confidence coefficient ¢, usually fall under two general headings, those that deal 
with minimizing the length, and those that deal with minimizing probabilities 
of covering false values (ordinarily along with the assumption of unbiasedness). 
The first approach is a matter of obtaining the greatest possible precision, and 
is analogous to searching for point estimates with small dispersion, while the 
second approach is formally identical with the theory of testing hypotheses. 

Optimal confidence intervals of the two types described above can coincide 
in a given problem, as in the case of estimating the mean of a normal distribu- 
tion with known variance. Nevertheless, in general the connection between 
them is obscure, and as far as we know has not been investigated. 

In the problem at hand no method is known for finding the confidence inter- 
val with shortest length. However, basing the confidence limits on the suf- 
ficient statistic (X, >>(X;—X)*) constitutes no loss of generality. Moreover, 
the further assumption that the limits remain unchanged upon the addition of 
a constant to all observations seems reasonable and leads to intervals depending 
only on >\(X;—X)*. The last restriction, to multiples of }>(X;—X)?, is 
admittedly arbitrary. Denoting }>(X;—X)* by S*, one might pose the follow- 
ing questions: 

1) Does the interval of shortest length based on (X, S?) depend only on S?? 

2) Among those intervals based only on S? is the interval of shortest length 

necessarily of the form 


2 We would like to thank the referee for bringing this interesting fact to our attention. 
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1 1 
S?, — 
bn An 

The confidence interval Jsy was obtained under no restrictions beyond that 
of its being the “shortest” of the unbiased intervals. Two other approaches will 
also lead to Jsv. First, if we assume invariance of the confidence limits with 
respect to the translation group mentioned above, and consider uniformly 
most powerful invariant tests (see Lehmann [4, Section 5.5]), instead of uni- 
formly most powerful unbiased tests, we will obtain Jsy. Secondly, if we start 
with an interval of type (1), and then minimize the ratio of the right endpoint 
to the left endpoint, rather than length or probability of false covering, I sv 
will again appear.? 

Several observations may be made by way of comparison between the 
various intervals mentioned. It can be seen, for example, that Jsy is.the unique 
unbiased interval of type (i), and hence that Iz, x2, and Igr are all biased. 
The length of sy can, however, be considerably greater than that of Jz. Jer 
is computationally convenient to use but can be even longer than J sv, and is 
thus not optimal in any sense discussed here. There appears to be nothing to 
recommend J zp. Finally, all intervals are equivalent for large n. 

There follow two examples, corresponding to values of n near the beginning 
and the end of the tables. 

Example 1: A sample of six observations was obtained from the distribution 
9(5, 4), using Dixon and Massey’s table of 91(0, 4) random numbers [2, Table 
A-24]. These observations are: : 


2.825, 4.898, 6.407, 4.917, 6.636, 0.612. 
= 4.3825, — #)? = 26.3708, n = 5. 


For the confidence coefficient ¢=.90 results may be summarized as follows: 


Type of 


Interval Observed Value Length Ratio 


Iu (1.395, 16.546) 15.151 11.861 
Isu (2.119, 19.466) 17.347 9.186 
ler (2.382, 23.031) 20.649 9.669 


Example 2: Another sample was selected from the % (5, 4) population with 
sample size N =30. The actual sample values are omitted. 


= 5.3064, >> (a; — = 103.5486, n = 29. 


Following is a table which summarizes the results in this example for e=.90: 


Type of 


Interval Observed Value Length Ratio 


Tux (2.202, 5.419) 3.217 2.461 
Isuv (2.381, 5.703) 3.322 2.395 
Ter (2.433, 5.848) 3.415 2.404 


| | 
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TABLE 678. DIVISORS FOR THE CONFIDENCE INTERVAL OF 
MINIMUM LENGTH,‘ 


x): 


CONFIDENCE COEFFICIENT ¢, n=N—1 DEGREES OF FREEDOM, 
a, IS THE UPPER FIGURE, AND }, IS THE LOWER FIGURE 


-900 -950 -990 -995 -999 
n 

2 .2104 .0201 .0100 .0020 
18.0077 21.4812 29.1362 32.3240 39.5708 
3 .5821 .3513 -1148 .O717 .0244 
17.6381 20.7437 27.5102 30 .3027 36.5959 
4 1.0561 - 7082 .2969 -2069 .0908 
18.1062 21.0632 27 .4603 30.0848 35.9845 
5 1.5938 1.1392 .5534 -4113 -2102 
18.9081 21.8001 28 .0269 30.5697 36.2654 
6 2.1750 1.6233 .8700 -6747 -3806 
19.8739 22.7410 28.8928 31.3966 36.9947 
7 2.7883 2.1473 1.2350 - 9871 -5979 
20.9303 23.7944 29.9229 32.4106 37.9541 
8 3.4262 2.7027 1.6397 1.3406 -8560 
22.0405 24.9147 31.0507 33 .5358 39.0631 
9 4.0840 3.2836 2.0775 1.7288 1.1499 
23.1844 26 .0769 32.2397 34.7308 40.2631 
10 4.7584 3.8855 2.5434 2.1469 1.4755 
24.3498 27 .2662 33.4685 35.9714 41.5223 
11 5.4467 4.5054 3.0334 2.5906 1.8287 
25 .5294 28 .4733 34.7240 37 .2430 42 .8238 
12 6.1472 5.1409 3.5447 3.0573 2.2078 
26.7180 29.6920 35.9963 38 .5330 44.1445 
13 6.8583 5.7899 4.0744 3.5439 2.6086 
27.9126 30.9184 37.2809 39.8378 45.4880 
14 7.5788 6.4510 4.6205 4.0483 3.0296 
29.1109 32.1497 38 .5733 41.1517 46.8441 


« For the meaning of “minimum length” in this context see Section 3. 
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TABLE 678—Continued 


-900 -950 -990 -995 -999 


8.3078 7.1227 4.5685 3.4676 
30.3113 33.3842 42.4732 48.2150 


9.0446 7.8043 . - 1040 3.9248 
31.5125 34.6197 ‘ 49.5766 


9.7883 4947 6523 4.3954 
32.7139 8560 50.9511 


10.5385 . 1932 .2128 4.8806 
33.9148 .0919 .4465 52.3245 


11.2947 -8991 7846 5.3786 
35.1148 .3271 5. -7723 


12.0563 6119 
36.3137 .5611 .0974 


12.8230 .3310 -9580 
37.5112 7936 .4216 


.5946 .0561 : .5588 
-7070 .0243 ‘ 7426 


.3706 . 7868 .1679 
.9011 2532 53.0616 


- 1508 .5227 9.7845 
.0935 -4802 54.3793 


. 2636 ‘ 10.4088 
. 2840 -7051 ‘ 55.6935 


. 7230 .0090 11.0396 
.4728 .9281 57.0065 


.5145 7587 11.6764 
-6598 .1491 58.3186 


.3095 .5128 12.3211 10.3146 
-8446 49.3675 59.6230 65.9955 


19.1076 17.2706 ‘ 12.9699 10.9003 
47.0279 50.5843 60.9295 67 .3589 


679 

55.0743 
56.4507 
38 57.8190 
39 59.1857 
‘| 
41 60.5545 

42 61.9157 
26 16 9.1580 
| 
a 44 64.6514 
45 
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TABLE 680. DIVISORS FOR NEYMAN’S “SHORTEST” UNBIASED 
CONFIDENCE INTERVAL, 


n ial n tal 


CONFIDENCE COEFFICIENT e, n=N—1 DEGREES OF FREEDOM, 
a, IS THE UPPER FIGURE AND b, IS THE LOWER FIGURE 


950 -995 .999 


.0847 ‘ ‘ -0018 
.5303 : 18.4677 


.1915 : 20.5244 


.6070 -0831 
-8024 22.4855 


- 9892 ‘ 1933 
. 3686 ‘ 24.3799 


.4250 -3519 
26.2160 


-9026 ‘ -5548 
-3923 28 .0053 


-4139 ‘ 7972 
-8604 - 7547 


‘ .0743 
-3050 -4709 


.5162 ‘ .3827 
- 7289 1543 


806 .7188 
.1347 .8097 


-7005 ‘ -0790 
.5247 : -4463 


-4609 
- 9004 d -0646 


5.9477 .8650 
27.2630 -6507 


680 
7.8643 9 
9.4338 11 
| 
4 
10.9583 12 
5 1.3547 
12.4424 14 a 
reat. 
6 1.8746 1 
13.8922 15 
7 2.4313 1 nae 
15.3136 17 
8 3.0173 2 os 
16.7108 18 ae 
9 3.6276 2 
18.0874 20 
10 4.2582 3 
19.4463 21 
4.9063 4 
20.7895 23 
22.1190 24 
13 6.2462 5 
23 .4362 25 
14 6.9347 
24.7423 


TABLE 680—Continued 
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.900 


-950 


7.6340 
26.0385 


8.3427 
27 .3257 


9.0603 
28 .6047 


9.7859 
29.8759 


.5188 
-1401 


- 2586 
.3978 


.0046 
-6494 


.8954 


.5138 
. 1362 


.2764 
.3719 


.0437 
-6030 


-8155 
-8296 


.5917 
-0521 


.3718 
.2706 


1558 
43.4855 


6.5909 
28.6141 


.9546 


-9099 


. 5842 
-6072 


2670 
-9209 


-9579 
. 2267 


-6562 
.5253 


.3614 
.8176 


.0730 
. 1036 


-7908 


.5142 
.6581 


. 2430 
.9273 


.9769 
.1916 


.7155 
.4514 


.4586 
7069 


10.2072 
46.7057 


10.8462 
48 .0645 


11.4923 
49.4157 


12.1447 
50.7610 


12.8033 
52.1004 


13.4674 
53.4350 


59.1196 


9.9309 
60.5496 


10.5035 
61.9829 


.990 .995 .999 
n 

15 4.8531 4.2965 3.2872 

34.0970 36.3114 41.2209 
16 5.4041 4.8100 3.7248 
29 35.5402 37.7927 42.7826 
17 7 5.9681 5.3373 4.1775 
i 31 36.9711 39.2609 44.3309 

32 38.3896 40.7147 45.8546 
19 7.1314 6.4300 5.1272 
31 33 39.7984 42.1590 47.3738 
20 7.7290 6.9938 5.6218 
32 35 41.1966 43.5912 48.8733 

21 8.3360 7.5671 6.1281 
33 36 42.5856 45.0132 50.3610 
22 12 11 8.9515 8.1496 6.6428 
ior 34 37 43.9672 46 4282 51.8481 
23 9.5752 8.7410 7.1671 
36 39 45.3409 47 8348 53.3266 

24 14 12 9.3416 7.7043 
37 40 49.2305 54.7826 

38 41 50.6209 56.2408 

39 42 52.0024 57.6820 
41 44 53.3778 
42 45 54.7466 

46 56.1114 
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4. METHOD OF CONSTRUCTION FOR TABLES 


The difficulties of calculation stem from the fact that there are many pairs 
of numbers (a,’, b,’) which satisfy the first condition f,42(@,’) =fn+e(bn’) with 
high precision, and which come sufficiently close to satisfying the condition 
Sef,(x)dz=e that their errors go unnoticed. Suppose, for example, that a,’ 
and b,’ are thought to satisfy the two equations, but in reality a,’ is slightly in 
error by being too small. If a,’ and 6,’ satisfy the first equation with high pre- 
cision, then b,,’ may be vastly in error,’ but the errors in both numbers might not 
be found when the second equation is used as a check, since small variation in 
the lower limit and large variation in the upper limit will produce small varia- 
tion in the resulting integral. The method of calculation selected used high speed 
computing devices, and consisted in dividing the calculations into two stages. 
First, a set of triples (ani, bn:, €:) was generated for each n by choosing a,;, deter- 
mining b,,; by Newton’s method, and e; by integration. Then for each confidence 
level «, a value a, was found from couples (a,;, ¢;) for nearby ¢; by Aitken’s 
method of five-point interpolation [3, p. 200]; b, was then obtained from a, by 
Newton’s method. Following this the resulting triple (a,, b,, €) was checked by 
integration. For even values of n the integration was based on a particular 
integral, and for odd values of n an improved trapezoidal rule was employed. 

A check on the accuracy for Tables 678 and 680 showed that 


| (Qn) | < 107 


for k=2, 4 and all n. Also 


< 107% 


f 


for all even n and for selected odd values of n including small sample sizes for 
all confidence levels. These bounds for error apply of course to the 8 digit 
numbers carried during the calculations (frequently amounting to 15-decimal 
numbers), and not to the final rounded off values for a, and b, which were 
entered in the tables. 
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EXTENDED TABLES OF THE PERCENTAGE POINTS 
OF +DISTRIBUTION 


Enrico T. FEDERIGHI 
Bendiz Radio 


In using Student’s é-distribution in testing component parts a need 
for extending the table of upper percentage points was revealed. The 
method of calculation of these percentage points is presented and a 
table containing these results is given. 


INTRODUCTION 


ELIABILITY research at Bendix Radio has revealed a need for tables of ex- 

treme percentage points of Student’s distribution. The major use of the 
table is in testing to failure of component parts where repeated stressing would 
lower the apparent mean failure stress. No better assumption can be made 
than that the universe of parts from which the sample was drawn is normal. 


1, METHOD OF COMPUTATION OF TABLES 


These tables were calculated with the aid of a G-15A Bendix electronic com- 
puter. The upper portion of the table (degrees of freedom <30) was obtained 
by integration of the density function of Student’s ¢-distribution, namely 


S(t, n) = 
2 


where n is the number of degrees of freedom. 
For even n<30 the integration can be done directly. In fact if we put 


g(t, n) = f f(u, n)du 
we have 
1 w 
g(t, n) ~ “) where v = + 
v 


and w is the polynomial in ¢ which consists of only those terms of the binomial 
expansion of v which contain integral powers of ¢. 
For actual computations we wrote 
w v? — w? 


v v(v + w) 


and computed v?—w? on a desk calculator since v? is a much more convenient 
quantity than v to work with. This avoided taking the difference of two very 
nearly equal quantities on the computer. Also, it made possible the fourtcen- 
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STUDENT’S t-DISTRIBUTION 
25 10 05 025 01 005 0025 001 
D.F. 

1 .000 3.078 6.314 12.706 $1,821 63.657. 127.321 318.309 
2 .816 1.886 2.920 4.303 6.965 9.925 14.089 22.327 
3 765 1.638 2.353 3.182 4.541 5.841 7.453 10.214 
4 741 1.533 2.132 2.776 3.747 4.604 5.598 7.173 
5 .727 1.476 2.015 2.571 3.365 4.032 4.773 5.893 
6 .718 1.440 1.943 2.447 3.143 3.707 4.317 5.208 
7 711 1.415 1.895 2.365 2.998 3.499 4.029 4.785 
8 .706 1.397 1.860 2.306 2.896 3.355 3.833 4.501 
9 .703 1.383 1.833 2.262 2.821 3.250 3.690 4.297 
10 -700 1.372 1.812 2.228 2.764 3.169 3.581 4.144 
11 .697 1.363 1.796 2.201 2.718 3.106 3.497 4.025 
12 695 1.356 1.782 2.179 2.681 3.055 3.428 3.930 
13 694 1.350 1.771 2.160 2.650 3.012 3.372 3.852 
4 .692 1.345 1.761 2.145 2.624 2.977 3.326 3.787 
15 .691 1.341 1.753 2.131 2.602 2.947 3.286 3.733 
16 .690 1.337 1.746 2.120 2.583 2.921 3.252 3.686 
17 .689 1.333 1.740 2.110 2.567 2.898 3.223 3.646 
18 .688 1.330 1.734 2.101 2.552 2.878 3.197 3.610 
19 .688 1.328 1.729 2.093 2.539 2.861 3.174 3.579 
20 .687 1.325 1.725 2.086 2.528 2.845 3.153 3.552 
21 .686 1.323 1.721 2.080 2.518 2.831 3.135 3.527 
22 .686 1.321 1.717 2.074 2.508 2.819 3.119 3.505 
23 685 1.319 1.714 2.069 2.500 2.807 3.104 3.485 
24 .685 1.318 1.711 2.064 2.492 2.797 3.090 3.467 
25 684 1.316 1.708 2.060 2.485 2.787 3.078 3.450 
26 .684 1.315 1.706 2.056 2.479 2.779 3.067 3.435 
27 .684 1.314 1.703 2.052 2.473 2.771 3.057 3.421 
28 .683 1.313 1.701 2.048 2.467 2.763 3.047 3.408 
29 .683 1.311 1.699 2.045 2.462 2.756 3.038 3.396 
30 .683 1.310 1.697 2.042 2.457 2.750 3.030 3.385 
35 .682 1.306 1.690 2.030 2.438 2.724 2.996 3.340 
40 .681 1.303 1.684 2.021 2.423 2.704 2.971 3.307 
45 .680 1.301 1.679 2.014 2.412 2.690 2.952 3.281 
50 .679 1.299 1.676 2.009 2.403 2.678 2.937 3.261 
55 .679 1.297 1.673 2.004 2.396 2.668 2.925 3.245 
60 .679 1.296 1.671 2.000 2.390 2.660 2.915 3.232 
70 .678 1.294 1.667 1.994 2.381 2.648 2.899 3.211 
80 .678 1.292 1.664 1.990 2.374 2.639 2.887 3.195 
90 .677 1.291 1.662 1.987 2.368 2.632 2.878 3.183 
100 .677 1.290 1.660 1.984 2.364 2.626 2.871 3.174 
200 .676 1.286 1.652 1.972 2.345 2.601 2.838 3.131 
500 .675 1.283 1.648 1.965 2.334 2.586 2.820 3.107 
1,000 .675 1.282 1.646 1.962 2.330 2.581 2.813 3.098 
2,000 675 1.282 1.645 1.961 2.328 2.578 2.810 3.094 
10,000 .675 1.282 1.645 1.960 2.327 2.576 2.808 3.091 

° .674 1.282 1.645 1.960 2.326 2.576 2.807 3 
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P 

.0005 

_$__§€£§_ __— 

ee 1 636.619 

jf 2 31.598 

3 12.924 

4 8.610 

5 6.869 

a 6 5.959 6.788 8.025 9.082 

be.’ 7 5.408 6.082 7.063 7.885 

hs 8 5.041 5.618 6.442 7.120 

we 9 4.781 5.291 6.010 6.594 

— 10 4.587 5.049 5.694 6.211 

7.098 

Pie 12 6.756 

13 6.501 

14 6.287 

15 6.109 

16 5.960 

17 5.832 

18 5.722 

19 5.627 

20 5.543 

aie 21 3.819 4.110 5.469 

ae 22 3.792 4.077 5.402 

core 23 3.768 4.048 5.343 

a 24 3.745 4.021 5.290 

ey 25 3.725 3.997 5.241 

324 4.587 5.197 

see 299 4.558 5.157 

ess) 275 4.530 5.120 

ac 254 4.506 5.086 

234 4.482 5.054 

cee 153 4.389 4.927 

094 4.321 4.835 

049 4.269 4.766 

014 4.228 4.711 

986 4.196 4.667 

962 4.169 4.631 

ee: 926 4.127 4.576 

899 4.096 4.535 
ae 878 4.072 4.503 

“a 862 4.053 4.478 

a .789 3.970 4.369 

3.922 4.306 

ie: 733 3.906 4.285 | 

.726 3.898 4.275 

| .720 3.892 4.267 

71° 3.801 4.265 
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place accuracy which was necessary and which could not be obtained on the 
computer. 
Example: For n=4 we have v=(t?+4)*/?, w=#+6t and v?—w*= 12+ 64. 
Hence 
6t? + 32 
+ + 4)? + + 


g(t, 4) = 


The percentage points for g(t, 4) were then calculated using inverse quadratic 
interpolation on its logarithm. 

For odd n<10 the integration yields a formula containing are tangents. 
These were calculated and the percentage points obtained as in the even case. 
The finite series for both cases above are given by Dunnett [1]. For odd n 
between 10 and 30 asymptotic interpolation was used based on the formula [1 ] 


n2 


H 
(2) 
n 


where z is the corresponding percentage point of the normal distribution, 
+1 + 1)(z? +3 
= and H,(e) == = 


The lower portion of the table n>50 was obtained from the asymptotic 
expansion 


G,(t) G 


ni 
where 

and the G,(t) are polynomials in odd powers of ¢ up to 4¢—1. 

The first seven G;(t) were calculated using the method of R. A. Fisher [2]. 
The first five given there were checked and an error was discovered in G;,(t). 
The seventh coefficient should be 915 instead of —915. 

The middle part of the table (n=35, 40, 45) was by far the most difficult to 
compute. Using the values for n=30 and n=50, the value for n=40 was ob- 
tained by asymptotic interpolation. The method used can be seen from the fol- 
lowing example taken from the case P = .00001. 


f(t, = 


(1) (2) (3) (4) (5) (7) 

n (2)-(2)0 | (1).(3) Logio (4) (1). (6) 
30 5.05409 78920 | 23.676 | 1.37431 1.9020 
40 4.83509 .57020 | 22.808 | 1.35808 1.8867 
50 4.71104 44615 | 22.308 | 1.34846 1.8775 
0 4.26489 0 20.460 | 1.31091 1.8372 


| 
| | | | | 
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In the above table H,(4.26489)=20.460, H2(4.26489) =(1.8372) (20.460) 
(In 10) and 30 (1.9020) +50 (1.8775) =80 (1.8867). 

The calculation process was then reversed to obtain the second column which 
was entered in the table as 4.835. 

These tables have pointed out errors in the Biometrika Tables for 2 and 3 
degrees of freedom and P=.001. There are also errors for 3, 5, and 7 degrees 
of freedom and P=.0005 in R. A. Fisher and F. Yates’ Statistical Tables for 
Biological, Agricultural and Medical Research. 
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PERCENTAGE POINTS FOR THE DISTRIBUTION 
OF OUTGOING QUALITY 


G. P. Srecx anv D. B. OwENn 
Sandia Corporation 


When a consumer buys only a few lots from a producer according 
to some scheme based on the results of sampling inspection, he should 
not depend on the average outgoing quality limit as being the upper 
bound on the defectiveness of the lots he buys. The consumer should, 
in fact, examine the percentage points of the distribution of outgoing 
quality. Approximate percentage points for the distribution of outgoing 
quality for a finite number of lots are found in this paper. 


1. INTRODUCTION 


HE purpose of this paper is to focus attention on the random nature of the 
2 ae, outgoing quality of a finite number of lots, to examine its distribu- 
tion, and to approximate the per cent points of its distribution. 

In sampling inspection, the Average Outgoing Quality, or AOQ, is defined 
to be the expected fraction defective in an inspected lot. (The assumptions 
under which this expectation is computed are given at the end of this section.) 
If the maximum of this AOQ function is taken over all values of incoming frac- 
tion defective, the Average Outgoing Quality Limit, or AOQL, is obtained. 
The AOQL is thus a number such that the expected fraction defective in an 
inspected lot is never more than this number regardless of the fraction defective 
in the lot prior to inspection. 

The concepts of AOQ and AOQL are both very useful for the general descrip- 
tion of sampling plans and Dodge and Romig [3] have given extensive tables of 
sampling plans indexed according to their AOQL’s. But too often the fact that 
the AOQ is the mean of a certain random variable is overlooked. Moreover, 
there is an unfortunate tendency among the uninitiated to consider the AOQL 
as a limit which bounds the fraction defective rather than the expected fraction 
defective. In order for the AOQ and AOQL to be an adequate description of a 
sampling plan, it is necessary to consider an indefinitely long sequence of lots so 
that the random nature of outgoing quality is submerged by the law of large 
numbers. A long sequence of lots is a reasonable assumption from the point 
of view of a producer inspecting his product prior to marketing because usually 
his primary aim is to produce just such a sequence. Such a sequence of lots is 
much less reasonable as soon as sampling plans are considered from the point 
of view of a consumer inspecting in order to decide what lots to purchase. The 
consumer generally purchases a relatively small number of lots and doesn’t 
care what the producer’s quality is in the “long run”—he only wants to know 
the quality of the small segment of lots he has purchased, and it is the distribu- 
tion of this quantity that is the subject of this paper. 

For example, suppose that a collection of 20 lots, each consisting of 12 items, 
is submitted for inspection to a sampling plan whose sample size is 5 and whose 
acceptance nuraber is 0. The AOQL of such a plan is 5.5%. However, if the in- 
coming quality gives an AOQ equal to the AOQL, then the probability is ap- 
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proximately 10% that such a collection of lots has an average per cent defective 
after inspection exceeding 7.5%. 

Throughout this paper the usual assumptions and definitions are made con- 
cerning the AOQ and AOQL, i.e., 

(i) rejected lots have been 100-per cent inspected (screened) and all de- 
fectives have been replaced with acceptable items, 

(ii) accepted lots are considered to contain the same defectiveness as they 
had before inspection even though some defectives. may have been 
replaced [2], 

(iii) inspection is assumed to be perfect, that is, a unit is called defective if 
and only if it is defective, 

(iv) the AOQ is the average fraction defective in all lots, both accepted and 

rejected, 

(v) the AOQL is the maximum value of the AOQ’s. 

Since the term “Average Outgoing Quality” has already been applied to the 
“long-run” average, the actual mean outgoing quality, where the mean is taken 
over a finite number of lots, will be called simply “outgoing quality” and the 
fact that it is an average will be suppressed. Examples are given at the end of 
the paper showing per cent points for the distribution of outgoing quality of 20 
lot groups under two assumptions regarding the incoming quality. 

2. DISTRIBUTION OF OUTGOING QUALITY WITH CONSTANT INCOMING QUALITY 


For the following discussion, let 
L=number of lots submitted, 
N =number of items in each lot, 
n=sample size for each lot, 
c=acceptance number of sampling plan being used, 
k;=proportion of defectives in the i-th lot, i=1, 2,---, L, 
p(k) = probability of accepting a lot if it has a proportion of defectives equal 
to k, 
p(ki) = pi, 
and 


1 
X,= 


where Y; is one or zero according as the 7-th lot is accepted or rejected. 

The random variable X; measures the observed outgoing quality of the L 
submitted lots. Define M, to be the 100a-per cent point of the distribution of 
i.e., P(X1<M,) =a. 

In case the lot quality is constant from lot to lot, i.e., the k’s are equal (which 
requires that the p’s be equal also) then the expected value of X,=kp, i.e., the 
expected value of X, is equal to the AOQ of the plan and (L/k)X, has a 
binomial distribution with mean Lp and variance Lp(1—p). Hence if C, is a 
100a-per cent point of this binomial distribution, then 


M, = (k/L)C. (1) 
is a 100a-per cent point of the distribution of Xz. 
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3. DISTRIBUTION OF OUTGOING QUALITY WITH VARIABLE 
BUT KNOWN INCOMING QUALITY 


More often than not the proportion defective will vary from lot to lot. The 
conditional mean and variance of Xz given ki, ke, -- - , ky are given by 
L 
A = EX, = Di kop./L 
( 2) 


L 
B? = Var = Do — pi)/L*. 
t=1 


The conditional distribution of Xz given values of the k’s may be approximated 
by a normal distribution and 


M,=A+tB (3) 


where A and B are defined by (2) above and where ¢, is the 100a-per cent 
point of the standardized univariate normal distribution, that is, ¢, is a number 


such that 
1 ta x 
exp (- = a. 
J 2 


In most instances, this procedure will give acceptable results when L>20 
regardless of the sample size per lot. The first example in section 5 shows the ap- 
plication of this method to a particular problem. 


4, DISTRIBUTION OF OUTGOING QUALITY WITH VARIABLE AND 
UNKNOWN INCOMING QUALITY 


It is seldom realistic to assume that the proportions of defectives are the 
same for all lots or that they are known. The vagaries of manufacturing being 
what they are, it is reasonable to assume that the proportion of defectives in 
a lot will vary from lot to lot, and generally this variation will exhibit some 
statistical regularity. If this is the case, then the proportion of defectives is 
itself a random variable with a distribution, and the following method uses 
this fact. 

If the k’s can be assumed to have a sampling distribution with mean yu and 
variance o*, and are assumed independent, then the unconditional mean and 
variance of X;, (A* and B*?, respectively) are obtained [5] and [1] from the 
conditional mean and variance given in (2) as follows: 


A* = EE(X,|k) = EA = E[kp(k)] 
B*? = E Var(X,| k) + Var E(X1| k) = EB? + Var A 
= (1/L)[E{k*p(k) [1 — p(k)]} + Var[kp(k)]]. 


Using the approximations [4] 
Ef(X) = f(EX) + ry (EX) | 


Var f(X) & o?[f’(EX) |, 
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one finds 
A* up(u) + (02/2) [2p’(u) + up” (u) | (4) 
(1/L) [u*p(u) [1 — p(u)] + (0/2)G(u)], 


where expressions for p(u), p’(u) and p’’(u) are discussed below and 


d[kp(k)] 


= 2p(u) + 4up’(u)[1 — p(u)] + u2p’(u) [1 — 2p(u)]. 


d? 


It should be noted that although the operating-characteristic curve, p(k), is 
defined in any one instance for only discrete values of k, it can be considered 
as continuous for mathematical purposes. For example, if the acceptance num- 
ber of the sampling plan is zero, then (note that k is the proportion defective 
and not the number of defectives), 


(N—n)! [NU 
N! 


p(k) = ai, 


where a;=1/[N(1—k) —n—i]. In this notation, the derivatives are as follows: 


= — No) 


pk) = Nk) Sew, 


i=l jal 


(h) k 
0 h>n. 


If the values of the mean and variance of X; which incorporate knowledge 
of the sampling distribution of the k’s, are used in the normal approximation 
to the distribution of Xz, it follows that 


M, = A* + t.B* (5) 


where A* and B* are defined by (4). The second example of section 5 shows 
how this method is applied to a particular problem. 


5. EXAMPLES 


The following examples are merely illustrative and are not of special im- 
portance in themselves. 
As a first example, consider the vase in which N =12, n=5, c=0, L=20, 
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ki=k,= - - - =k,=k. In Figure 693, the 10-, 30-, 50-, 70-, and 90-per cent points 
of the distribution of outgoing quality as computed from (3) are given. Note 
that the AOQ function coincides with the 50-per cent points in this case. 

As a second example, consider the same sampling plan, N =12, n=5, c=0, 
and L=20 as above for the case in which the k’s are from a distribution with 
a mean proportion defective equal to u and standard deviation o=y(1—4)/4. 
While this is an artificially constructed example, this form of ¢ was chosen be- 
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Fic. 693. Per cent points for the distribution of the actual outgoing quality of a group 
of 20 lots, each containing 12 units, which were equally defective prior to inspection by a 
sampling plan for which n=5 and c=0. The OC-curve, p(k), is also given. 


cause it is zero where it should be zero, is an increasing function of yu over the 
range of interest, and has a maximum of about five per cent. Figure 694 shows 
the normal approximation to the 10-, 30-, 50-, 70-, and 90-per cent points of 
the distribution of mean outgoing quality given by (4) and (5) for this case. 
Again note that the AOQ function coincides with the 50-per cent points. Note 
also that the changes in the mode! from the first example (Figure 693) to the 
second example (Figure 694) make only slight changes in the graphs. 

Since ignoring the terms of (4) involving o changes the graphs of the figure 
illustrating this example by at most 0.2 per cent (in an absolute sense) it is felt 
that the errors in using the approximations given in (4) are negligible for most 
practical purposes. 
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Fia. 694. Per cent points for the distribution of the actual outgoing quality of a group 
of 20 lots, each containing 12 units, whose initial compositions, k;, are obtained by sam- 
pling from a population with u,=y and o,=y"(1—y)/4. The sampling plan is given by 
n=5 and c=0. The approximate OC-curve, p(z), is also given. 

(Note: The true OC-curve is Ep(k) and would lie a little above p().) 
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squares.” His main fields of interest are the Bible, Talmud and Rabbinic Law, graphic 
methods in pure mathematics and statistics, geometric and visual optics, application of 
electronic and optical instruments in medical research, general medicine and opthal- 
mology, and tumor statistics. About this article he says, “The writing of the present 
article was encouraged by personal conversations with Dr. Jacob Mincer of the City 
College of New York.” 

RONALD HANNA BEATTIE, 56, oas been Chief, Bureau of Criminal Statistics, 
Department of Justice, State of California, since 1945. He received his B.A. (1926), J.D. 
(1928), and his M.A. (1932) in sociology, all from the University of Oregon. He has been 
research assistant, University of Oregon School (1929-32); research associate, Bureau of 
Public Administration, University of California, Berkeley (1932-36); Regional Director, 
Attorney General’s Survey of Release Procedures (1936-37); Statistician for criminal 
statistics, United States Bureau of Census, Washington, D. C. (1937-40); and Statis- 
tician, Administrative Office, United States Courts, Washington, D. C. (1940-45). Be- 
sides various other articles relating to criminal statistics he co-authored Survey of Admin- 
istration of Criminal Justice in Oregon (1932) with Wayne L. Morse; and wrote Criminal 
Judicial Statistics for California (1936). His fields of interest are statistics and research, 
law enforcement, administration of criminal justice, and corrections. 

GEORGE EDWARD PELHAM BOX, 40, was appointed Professor of Statistics 
at the University of Wisconsin this year. He received his B.Sc. (1948) in Mathematical 
Statistics, and his Ph.D. (1952) in Mathematica! Statistics both from the University of 
London. He was Head of Statistical Research Section (1948-56) of Imperial Chemical 
Industries, Manchester; Visiting Research Professor, University of North Carolina (1953); 
and Director, Statistical Techniques Research Group, Princeton University (1956-59). 
He has written numerous works related to his main interest which is design and analysis 
of experiments in the industrial application of statistics. About the present article he says, 
“At the time this article was conceived most of the work on experimental design, except 
that of Hotelling, had been concerned with finding arrangements that would minimize 
variance on the assumption that the mathematical model was exactly correct. If we ask 
a silly question we can expect to get a silly answer and the (silly) answer we get from pro- 
ceeding in this way includes the proposition, for example, that if we have N experiments 
at our disposal, and we want to fit a straight line we should run half of these with the 
independent variable set at the highest possible value and the other half with the inde- 
pendent variable set at the lowest possible value. When an experimenter wants to fit a 
straight line it is usually because he thinks it likely, although not certain, that a straight 
line might provide a reasonably close graduation over the region in which he is interested 
to the (almost certainly curved) true relationship. In these circumstances the typical 
experimenter would usually want to make a series of runs which in some sense “covered” 
this region of interest. It should be noted that the region of interest at a particular stage 
of experimentation will not necessarily or even usually correspond to the region of oper- 
ability, that is the region over which experiments of the kind contemplated could be run. 
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A design which “covered” the region of interest would usually be better than one which 
had observations only at the boundaries of the region of operability (1) because in the 
region of interest the straight line fitted to such points would probably more closely gradu- 
ate the true underlying curve and (2) because the observations made at intermediate 
points would warn the experimenter when linear graduation was inadequate. This is a 
special instance of the more general proposition that we cannot expect an experimenter 
ever to supply an exact model. He has to some extent built up the model as he goes along. 
Statistical procedures of analysis and design are required therefore which work not only 
on the assumption (which is never true—that the model is exactly right—but (1) behave 
reasonably well when departures from the model of the kind likely to be met in practice 
occur and (2) give warning of serious discrepancies from the assumed model. 

“Dr. Draper was spending some time at the Statistical Techniques Research Group 
at Princeton in the summer of 1957 and this paper is the outcome of the work done on the 
problem of applying the above ideas to the case of fitting multivariable polynomials by 
least squares.” 

WILLIAM AITKEN STEWART CROMARTY, 33, has been Assistant Professor, 
Department of Agricultural Economics, Michigan State University, since 1955. He re- 
ceived his B.Sc. (1951) in agriculture from Ontario Agricultural College, and his M.Sc. 
(1953) and Ph.D. (1957) both in agricultural economics from Michigan State University. 
He was Analytical Statistician for the United States Department of Agriculture (1954). 
He has written two technical bulletins for Michigan State University, Agricultural Ex- 
periment Station, The demand for farm machinery and tractors and The demand for farm 
trucks. His two articles in the Journal of Farm Economics have been “Changing relation- 
ships between agriculture and the national economy” and “The farm demand for tractors, 
machinery and trucks.” His field of interest is in the application of statistical techniques 
to measurement problems within agriculture. 

“The present article grew out of a meeting between Karl Fox, now of Iowa State 
University, Lawrence Klein, now with the Wharton School of Finance, and Glenn Johnson 
of Michigan State University. The problem was to examine agriculture in more detail 
than had Klein and Goldberger. Fox who was with the USDA at that time consented to 
my working with the Agricultural Marketing Service to gain familiarity with the data. 
Richard Foote was very helpful at that time as:were also Daniel Suits and Arthur Gold- 
berger when I returned to the University of Michigan.” 

NORMAN RICHARD DRAPER, 28, has been a Technical Officer (Statistician), 
with Imperial Chemical Industries (Plastics Division), Welwyn Garden City, Herts, 
England, since 1958. He received his M.A. (1954) in mathematics, and his Diploma 
(1955) in mathematical statistics, both from Cambridge University; his Ph.D. (1958) 
in mathematical statistics, from the University of North Carolina. He spent the summer 
of 1955 with Imperial Chemical Industries in Manchester, England, the summer of 1956 
with American Cyanamid Company, New York, and the summer of 1957 with Statistical 
Techniques Research Group, Princeton. This is his first JASA publication. His main 
field of interest is design and analysis of experiments. 

OLIVE JEAN DUNN, 44, has been Assistant Professor of Biostatistics and Assist- 
ant Professor of Preventive Medicine and Public Health, University of California at Los 
Angeles, since 1957. She received her B.A. (1936), M.A. (1951), and Ph.D. (1956), all in 
mathematics from the University of California at Los Angeles. She was Assistant Pro- 
fessor of Statistics, Iowa State College (1956-57). Her “Estimation of the means of 
dependent variables” (Dec. 1958) and “Estimation of the medians of dependent vari- 
ables” have appeared in the Annals of Mathematical Statistics. Her main interest is in bio- 
statistics. About this article she says, “Most of the research for this paper was part of 
my doctoral dissertation. The idea of writing an article for the research worker who 
uses statistical methods was suggested to me by one of the non-statisticians on my doc- 
toral committee at the time of my final examination. In working on the various con- 
fidence intervals for k means, I thought of the Bonferroni inequality ones quite early, 
but since they were so simple I thought they couldn’t possibly be of any use. I spent a 
long time trying to prove that the confidence intervals which would be used in the case 
of independent variables could also be used for dependent variables. After failing to find 
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a general proof for this, I finally noticed that the simple Bonferroni intervals were nearly 
as short.” 

ENRICO THOMAS FEDERIGHI, 32, has been Senior Mathematician for the Johas 
Hopkins University Applied Physics Laboratory, since February, 1958. He received his 
B.A. (1950) from Antioch College, his M.A. (1954) from Johns Hopkins University, and 
his Ph.D. (1957) from Indiana University, all in mathematics. He was with Bendix Radio 
in Maryland, from 1953 to 1957 first in summer positions and later (1956) as Assistant 
Project Engineer, and from 1950 to 1953, was a Junior Instructor at Johns Hopkins Uni- 
versity. His ‘‘The use of chi-square in small samples’ appeared in the American Socio- 
logical Review (Dec. 1950). His interests include finite groups, number theory, statistics, 
combinatorial analysis, and class field theory. 

PETER ENGEL pve JANOSI, 31, has been Economist, General Economics Depart- 
ment, Standard Oil in New York, since 1956. He received his B.A. (1950) from Wesleyan 
University, his M.A. (1951), and his Ph.D. (1956) from the University of Michigan all 
in economics. He was Study Director at the Survey Research Center (1954-56) and mem- 
ber of the Research Seminar for Quantitative Economies (1955-56) at the University of 
Michigan. His previous publications have appeared in the Journal of Marketing and the 
Michigan Business Review. His major interests are in forecasting of business conditions, 
and in empirical verifications of propositions in economic theory. 

NORMAN MAURICE KAPLAN, 42, has been Economist at the RAND Corporation, 
since 1950. He received his B.A. (1939) and M.A. (1947) both in economics from the 
University of Chicago. He was Assistant Professor of Economics, Illinois Institute of 
Technology (1947-50). His main published articles have appeared in the Journal of 
Political Economy. They are, “Investment alternatives in Soviet economic theory” (Apr. 
(1952), and “A comparison of Soviet and American retail prices in 1950” (Dec. 1956) 
with Eleanor S. Weinstein. He contributed “Capital formation and allocation” to Soviet 
Economic Growth (Ed. A. Bergson), Row Peterson, 1953. His interest is in Soviet economy, 
and theory of economic growth. 

GERALD WALTER KLETT, 30, has been a graduate student and research assistant 
at the University of Washington, since 1957. He received his B.S. (1957) in mathematics 
from the University of Washington. His fields of interest are probability and statistics, 
operation and utilization of electronic computing machines. 

MAX RAY MICKEY, JR., 36, has been Statistician, General Analysis Corporation, 
11753 Wilshire Boulevard, Los Angeles 25, California, since October, 1958. He received 
his B.Sc. (1947) in electrical engineering from Virginia Polytechnic Institute; and his 
Ph.D. (1952) in statistics from Iowa State College. He was with the RAND Corporation 
from 1950-52, and 1955-58; and from 1952-55 was Assistant Professor of statistics ai, 
Iowa State College. His principal activity has been with operations research. About the 
present article he says, “The basic elements of the submitted paper were prepared in 1954. 
It grew out of contemplation of the proposition that if a sample is representative of a 
population, one should be able to infer the sampling characteristics of an estimator by 
applying the form of the estimator to subsamples of the given sample. The original 
paper was presented at the regional meeting of the Institute of Mathematical Statistics 
held at Iowa City, Iowa, November, 1954.” 

DONALD BRUCE OWEN, 37, has been Supervisor, Statistical Division 5125, 
Sandia Corporation, since October, 1957. He received his B.S. (1945) and M.S. (1946) 
both in mathematics, and his Ph.D. (1951) in statistics, all from the University of Wash- 
ington. He was Associate (1946-51), then Instructor (1951-52) in mathematics at the 
University of Washington; and Assistant Professor and Research Associate (1952-54) at 
Purdue University. His main published titles are “A double sample test procedure” (Sept. 
1953) and “Tables for computing bivariate normal probabilities” (Dec. 1956), both in 
Annals of Mathematical Statistics, and “A method of computing bivariate normal proba- 
bilities with an application to handling errors in testing and measuring,” The Bell System 
Technical Journal (Mar. 1959). His fields of interest are multivariate normal distributions, 
tabulations of statistical functions, application of statistics to engineering, quality control 
and reliability. 

JOHN WINSOR PRATT, 28, has been Assistant Professor of Statistics at Harvard 
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University since 1957. He received his A.B. (1952) in mathematics from Princeton; and 
his Ph.D. (1956) in statistics from Stanford. He was research associate (1955-57) in 
statistics at the University of Chicago. His papers have appeared in the Annals of Mathe- 
matical Statistics, and his main area of interest is mathematical statistics. About this article 
he says, “When I was preparing to discuss how to handle 0’s and ties in Wilcoxon’s test 
in a course at Harvard, it occurred to me some curious things might happen that the 
literature didn’t seem to mention explicitly; I investigated, and found that indeed they 
could happen.” 

GEORGE POWELL STECK, 34, published au article in JASA in June 1959, and his 
biographical sketch appears in that issue. Since then his interests have become focused 
on random walks. 

ROBERT FLEMMING TATE, 38, has been Assistant Professor in the Department 
of Mathematics, University of Washington, since 1955. He received his B.A. (1944) in 
mathematics, and his Ph.D. (1952) in mathematical statistics, both from the University 
of California; and his M.S. (1949) in mathematical statistics from the University of 
North Carolina. He has been Lecturer at the University of California (1951-53) and In- 
structor at the University of Washington (1953-55). He wrote, “Unbiased estimation. 
Functions of location and scale parameters” for the Annals of Mathematical Statistics 
(June 1959); and “The theory of correlation between two continuous variables when one 
is dichotomized” for Biometrika (June 1955). His first article to appear in JASA was, 
“Applications of correlation models for biserial data” (Dec. 1955). His main field of in- 
terest is mathematical statistics (estimation). 

GORDON TULLOCK, 37, has been Research Fellow at the Thomas Jefferson 
Center for Studies in Political Economy of the University of Virginia, since 1958. He 
received his Doctor of Laws (1947) from the University of Chicago, and did post-doctoral 
work (1950-52) at Yale and Cornell. He was a foreign service officer specializing in China, 
Department of State (1947-56) and was doing writing or research until the Virginia ap- 
pointment. He wrote “Hyperinflation in China 1937-49” for the Journal of Political 
Economy (June 1954), and “Paper money, a cycle in Cathay” for the Economic History 
Review (June 1956). A book by him in the field of political theory is due to be published 
next year. His main fields of interest are law, economics, political theory, mathematics, 
and “in an amateur way, physics.” He also speaks Chinese. “The present article,” he says, 
“is the outcome of some work I was doing on the social organization of science. I had 
reached somewhat the same conclusions as Dr. Sterling but had not yet undertaken any 
empirical research when I saw his article. This saved me the work of looking up the data 
myself, but also left me with some thoughts which Sterling had not published. The 
present comment enshrines these remnants.” 

CHARLES DAVID WINDLE, 33, is Research Psychologist at Pacific State Hos- 
pital, an institution for mental defectives in Pomona, California. He received his B.A. 
(1949) from Oberiin, and his M.A. (1950) and Ph.D. (1952) from Columbia both in experi- 
mental psychology. From 1952 to 1955 he did military research for ONR at the State 
University of Iowa and for Human Resources Research Office in Washington, D. C. and 
Ft. Benning, Georgia. From 1955 to 1957 he was Supervisor of Personnel Research for the 
Iranian Oil Exploration and Producing Company, Masjid-i-Sulaiman, Iran. This is his 
first paper in JASA, but he has published previously in Psychological Bulletin, Educa- 
tional and Psychological Measurement, Journal of Experimental Psychology, Journal of 
Clinical Psychology, Journe! of Abnormal and Social Psychology, Journal of Personality, 
Journal of Consulting Psychology, and American Journal of Mental Deficiency. Currently 
he is engaged in research on the social factors affecting movements into and out of an 
institution for mental defectives. 
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Elementary Statistical Methods. Revised Edition. Helen M. Walker and Joseph Lev. 
New York: Henry Holt and Company, 1958. Pp. xvi, 302. $4.75. 


Henry F. Kaiser, University of Illinois 


Walker and Lev have done it again: this is a fine textbook. While outlining the 
general contents of the book, I shall endeavor to indicate that for its purpose, “. .. a 
one-semester introductory course” (most likely for beginning educationists or psy- 
chologists), Walker and Lev is about as good a text as may be found. 

Chap. 1 is a well-written but not atypical big-picture introduction. Beginning with 
some preliminary definitions in Chap. 2, the balance of the material through Chap. 4 is 
concerned with the details of techniques for gathering and recording data, construct- 
ing and using tables, and making graphs. Attractively and competently presented, 
these subjects are necessarily somewhat dreary, but probably serve well as a gentle 
warmup for beginning students. After discussing “tabulating and graphing data on a 
scaled variable” in Chap. 5, the notions of central tendency and variability are intro- 
duced early in Chap. 6 through the median and the range. 

Chap. 7, “Mean and standard deviation,” begins more important material. After 
defining the sample mean, the mean deviation, delightfully, is dispensed with in a 
short paragraph; the sample variance is introduced simply as s*= }>(X —X)?/(N —1) 
—no fussing with biased estimators allowed—and immediately the simplest of all 
computing formulas is presented—with no algebraic foolishness interposed. The 
sample standard deviation is then quietly defined as the positive square-root of the 
variance. A retrenchment—summarizing and expanding the treatment of frequency 
distributions—is the concern of Chap. 8. Particularly enchanting is the authors’ dis- 
position of the distinction between lepto-, meso-, and platykurtosis: “These terms 
are of no practical importance . . .” being “offered only for amusement.” 

Typical of this whole text, the exposition of bivariate linear regression in Chap. 9 
is particularly clear. This comment applies to the computational outlines, the treat- 
ment of conditional distributions, and the more difficult problem of explaining the 
regression paradox. Chap. 10 on correlation begins by explicitly distinguishing a cor- 
relation from a regression problem; this might have gone easier had Walker and Lev 
first distinguished a random from a sure variable. A number of formulas for the cor- 
relation coefficient are given—not to exhibit the agility of the authors at algebraic 
gymnastics—but for the eminently defensible reason of illustrating conceptually dif- 
ferent interpretations that may be given to the correlation coefficient .Chap. 11 is 
devoted to struggling through the quicksands of interpreting “. . . the size of corre- 
lation and regression coefficients.” This material might more appropriately be dealt 
with by a careful prior consideration of the populations involved. Presently this chapter 
seems to read like a manual for rationalizing—after the fact—unexpected correlation 
coefficients. The chapter concludes with a brief consideration of partial correlation 
—distinguished by an exact statement about the “correlation between residuals.” 

Chap. 12 considers the normal distribution. Walker and Lev first indicate that the 
major application of the normal distribution is in statistical inference (because of the 
central-limit theorem) and squelch the notion that parent populations are invariably 
normal. A detailed outline of how to operate with their tables of the normal distri- 
bution follows the introductory remarks. The normal distribution is then used for 
scaling scores in educational measurement. 

After 202 pages we come to Chap. 13, “Introduction to statistical inference.” The 
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class which gets this far and stops is being badly short-changed. Careful definitions 
of population and sample are given, the notion of “randomness” is clearly discussed 
—and the problem of obtaining random samples is effectively described. The most 
important distinction between a parameter and a statistic is forcefully made; this is 
in stark contrast to the usual book at this level for behavioral scientists. After this 
excellent presentation of the preliminaries, the balance of Chay. 13 is devoted to esti- 
mation. Point estimation is discussed in terms of two criteria. The notion of unbiased- 
ness provides the rationalization for dividing by (N —1) in the sample variance; the 
notion of efficiency provides a better basis for choosing a measure of central tendency 
than the usual hoary twaddle. Based on a well-developed explanation of the sampling 
distribution of the mean, the method for obtaining a confidence interval for the 
population mean is outlined—including a clear explanation of the subtle point of not ~ 
making a probability statement about a confidence interval. The ¢-distribution is 
mentioned in passing but, quite properly, not dawdled over. This Chap. 13, “Intro- 
duction to statistical inference,” is, in my opinion, a truly fine piece of pedagogical 
writing. I say this in the particular context of having typical beginning behavioral 
sciences students understand these most important of all ideas in statistics; it is in 
marked contrast to the anachronistic garbage (“Memorize the important formula for 
the standard error of the median kurtosis.”) often seen in popular competitive texts. 

I am a little less enthusiastic about Chap. 14, “Testing hypotheses” (about means). 
The early definition of level of significance seems a little too implicit for so central a 
notion. The distinction between one-sided and two-sided tests is hampered by re- 
stricting the null hypothesis to a specific hypothesis of no difference. This apparent 
devotion to a dogma of an earlier statistical era has led Walker and Lev to give a 
somewhat confusing (but not inaccurate) interpretation of the power of a one-sided 
test when the null hypothesis of no difference does not obtain, and does not obtain in 
the “wrong” direction. On the other hand, we have the truly avant garde situation of 
a student, after carefully reading this chapter, being quite able to compute the power 
of a test of a statistical hypothesis where the sampling distribution of the test-statistic 
is assumed to follow a normal distribution. 

Chap. 15 is concerned with “Inferences about proportions:” point estimates, inter- 
val estimates, and testing hypotheses, an ideal introduction to the intrinsically dif- 
ficult Chap. 3 of Walker and Lev’s excellent more advanced text, Statistical Inference. 
The final Chap. 16, “Inferences about correlation coefficients,” disposes of the stand- 
ard error of a correlation coefficient in one short paragraph, gives a test for the null 
hypothesis that the population correlation coefficient is equal to zero, and more gen- 
erally, an outline of the usefulness of Fisher’s z for tests of hypotheses about and 
interval estimates for population correlation coefficients. The book closes with a dis- 
cussion, in the light of this chapter, of an interpretation of the “size of a correlation 
coefficient”—an improvement over the material covered in Chap. 11. 

Following are 18 pages of tables (normal distribution, square-roots, squares, t- 
distributions, distribution of the sample correlation coefficient, Fisher’s z, ranks to 
standard scores, random numbers, binomial distributions), a glossary of symbols 
used, a 6-page list of all formulas, answers to all problems, and a carefully made 6-page 
index. 

One overriding criterion would seem to have dictated the decisions made in writ- 
ing the book. That is a careful consideration of the problems of ordinary students at- 
tempting to learn statistics. Commercial appeal, impressing other statisticians, and 
the like, are all secondary to the goal of teaching students a great deal of worthwhile 
statistics as effectively as possible. On every page, it would seem, there are indications 
of master teachers at work. 
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Multivariate Correlational Analysis. Philip R. DuBois. New York: Harper & Brothers, 
1957. Pp. xv, 202. $4.50. 


Paut 8S. Dwyer, University of Michigan 


This book is devoted to a systematic interpretation and treatment of multiple 
and partial correlation formulas and methods. It emphasizes the concept of partial 
covariance as the basic integrating concept in this area of multiple regression. This 
concept is also closely related to the main feature of the recommended computation 
method which contains both abbreviated and non-abbreviated versions. Then suc- 
cessive topics and techniques are developed and provide brief (averaging about 11 
pages) explanations of those topics which the author thinks are important to his 
presentation. There is a reference list of 36 titles, three appendices containing numeri- 
cal tables and a “multivariate analysis chart,” and a glossary of terms followed by an 
index of names and an index of subjects. 

It should be made clear first that this book is not devoted to the general area of 
multivariate analysis as the term is generally understood by statisticians so that the 
table of contents has little in common with that of a book such as T. W. Anderson’s 
Introduction to Multivariate Statistical Analysis. Though the author does use the term 
“multivariate analysis” (see p. 122), it is clear from the treatment, and from the title 
of the book itself, that he is giving primary consideration to problems associated with 
multivariate linear correlation. It should be made clear further that the treatment is 
for the most part directed toward the problem of interpretation and computation 
and not to problems of sampling though there is some consideration given to these 
problems in Sec. 14 and occasionally (see p. 96 and p. 116) elsewhere. 

The more formal treatment of Sec. 13 is devoted to the basic concepts of multi- 
variate linear correlation including standard regression coefficients, multiple corre- 
lations, partial r’s, part correlations, and multiple partial r’s. These are all worked 
out in terms of the basic partial variances and partial covariances of different orders. 
There are recursion formulas for standard regression coefficients which the author 
proposes as alternatives to those of Yule. Equivalence to those of Yule is illustrated. 
There is no mention of sampling problems in Sec. 13. 

The partial covariance between two criterion variables is the average value of the 
products of the residuals resulting from the use of the same set of n predictors. This 
average is formed by dividing by N (the number of pairs of residuals) rather than by 
the number of degrees of freedom. The definition of variance of residuals for a single 
criterion also calls for the division of the sum of the squares of the residuals by N. 
The variances and covariances are the mean squares of standardized residuals. 

A recognition of the sampling problem appears in Sec. 14. We are first led to 
“methods of inferring multivariate relationships as they would be if one or more of 
the conditions in the sample were changed.” The treatment leads to such concepts 
as partialling variance into the elements of a matrix, a correction for attenuation, 
correcting r’s to any hypothetical reliability, and estimation of correlations missing 
from a matrix. The author then presents a formula for the shrinkage of the multiple 
(Wherry) and a formula for the reliability of a residual. A final page, devoted to the 
sampling distributions of multivariate r’s, contains references to Yule, Fisher, Ken- 
dall, and McNemar. 

The final Sec. 15 discusses multivariate analysis in social science research. There is 
reference to multivariate curvilinear analysis and a discussion of experimental tech- 
niques designed to produce predictors which are uncorrelated. The author says the 
basic theorem in multivariate analysis is: 


Every value of a variable may be divided uniquely into two uncorrelated com- 
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ponents: a portion perfectly correlated with an outside variate and a portion uncor- 
related with the outside variate. 


He also discusses very briefly such topics as true variance and error variance, the 
reduction of the number of variates, closed systems of varie ‘es, and the construction 
of variates. 

In addition to the treatment of multivariate linear regression, the author has pre- 
pared two sections (11 and 12) on factor analysis. The author does not claim that the 
material in these sections constitutes a systematic presentation of factor analysis. 
He has attempted to demonstrate, appropriate to the objective of his book, relation- 
ships between factor analysis and the concepts of regression analysis, especially 
partial correlation. 

One question of notation should be considered. The derivation of multivariate 
regression formulas brings out an inconsistency between the mathematical matrix 
notation (in which the first subscript indicates the row and the second the column) 
and the Yule regression notation in which the first primary subscript indicates the 
predicted variable and the second primary subscript the predicting variable whose 
coefficient is being indicated. The author indicated that he has ignored (p. 132) the 
mathematical convention on order of subscripts in order to maintain the Yule nota- 
tion. The author of this review has advocated for some time that the conventional 
mathematical notation should be used and that the resulting notation for a regression 
coefficient should feature first the predictor variable, then the predicted variable, and 
finally the other predictor variables since the more precise notation is then obtained 
by adding symbols, without rearrangement, to the less precise notation which features 
only the specific predictor variable. 

In the opinion of the reviewer, this book contains material which is good for those 
for whom the presentation is designed. It is also his opinion that, once the nature of 
the successive partial covariance matrices to the basic concepts of multiple regression 
is understood, alternative computational forms can be used as effectively. I think that 
the author would agree that the student of general multivariate analysis should be 
aware of the available treatments of many topics not covered by this book. 


Sampling Opinions. Frederick F. Stephan and Philip J. McCarthy. New York: John 
Wiley and Sons, 1958. Pp. ix, 451. 


EvLeanor E. Maccosy, Stanford University 


This book is intended to serve several purposes. It deals with the practical and 
some of the theoretical aspects of sampling for interview surveys, and attempts to do 
so in such a way as to be understood by non-technical readers. At the same time it is 
a report to the sample-survey-taking profession of the findings of a detailed study of 
current sampling practices, as they are found in the major survey organizations of the 
nation. The study reported herein was sponsored jointly by the National Research 
Council and the Social Science Research Council; this is, then, a companion volume 
to Herbert Hyman’s earlier book on interviewing.' 

The book is divided into three sections. Part I, “The nature and role of sampling,” 
is a highly simplified general discussion covering such topics as a definition of “repre- 
sentativeness” and a description of the different varieties of sampling in current use. 
Part II, “Empirical studies,” presents the findings of the authors’ audit of current 
sampling practices. Part III, “Design of sample surveys,” discusses sampling in the 
context of the entire survey process, showing, for example, how the choice of sampling 
method is related to such factors as the subject-matter of the study and the inter- 


1 Hyman, Herbert, Interviewing in Social Research. Chicago: University of Chicago Press, 1954. 
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viewing method to be employed. The text of Parts I and III uses “homey” language 
wherever possible (e.g., p. 72, “an active sampler goes out to get a sample in much the 
same way as a housewife takes her list and basket to go shopping”). While one can 
only sympathize with the authors’ objective of communicating with business people 
and other potential users of surveys who do not have statistical training, this re- 
viewer does not share the estimate of its audience’s intelligence which seems to be im- 
plied by the book’s level of writing. For example, on p. 61 we find: 


Group measurements are derived from individual measurements, but they cannot be 
obtained simply by measuring only one individual. There may be no possibility of 
finding an individual whose measurements are exactly the same as those of the 
population. Thus there is no individual who is 55% Republican in the same sense 
that the voting population may be. 


For the readers of the Journal of the American Statistical Association and for teach- 
ers of graduate courses in research methods, the material of interest will largely be 
found in Part II. This section is a well-organized (and more technically written) 
presentation of evidence on the performance of several varieties of quota and proba- 
bility samples, including an excellent summary of the available methods for comput- 
ing the variances of these samples. Sometimes the authors draw on previously pub- 
lished material, sometimes on unpublished reports which survey organizations have 
prepared for internal use, and sometimes upon new data gathered by the authors 
themselves. Notable in the latter category is their analysis of the field operations of a 
national quota sample. They asked interviewers on a survey conducted by National 
Opinion Research Center to keep a diary, describing exactly their steps in selecting a 
respondent—the number of “not at home’s,” refusals, and persons approached who 
did not fit the quota categories, as well as the time spent by the interviewers in the 
search for respondents. For a number of years we have had a good deal of information 
concerning the effects upon probability samples of failure to interview a proportion 
of the originally designated sample; the study reported here gives us comparable in- 
formation on a quota sample. The authors conscientiously do not take sides between 
probability and quota samples, pointing out that each can be biased to varying and 
unpredictable degrees. But partisans of either kind of sample can find support for 
their preference in these findings. Quota samplers will be pleased by the evidence that 
a quota sample can be very well-balanced within sub-categories (e.g., age well dis- 
tributed within social-class groupings) as well as across the whole sample, even when 
interviewers are not required to maintain within-group balance. They will also be 
pleased to note that the biases which arise from refusals and not-at-home’s tend to 
affect demographic characteristics and information questions more than they do 
opinion questions. But opponents of the quota method will find support for their 
contentions in the fact that quota interviewers made approximately two attempts to 
obtain an interview for every interview completed; that is, half the attempts were 
unsuccessful. This points to a danger of considerable bias from the omission of rela- 
tively inaccessible and reluctant respondents who are shown elsewhere in the book to 
differ in certain characteristics (including political preferences) from the rest of the 
population. 

Where technical issues are discussed, the positions taken seem sound, with perhaps 
a single exception. The authors say that in multi-stage probability sampling, when 
primary sampling units are unequal in size, the sample will be biased. It is clear 
enough that when both the proportion of respondents having a given characteristic 
and the number of cases that will fall into the sample are subject to sampling varia- 
bility, the variability of the resulting estimate is greater and the process of estimating 
the variability is more complex. But it is not obvious (at least to this reviewer) that 
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dias will result from having sampling error in both the numerator and denominator 
of a proportion. 

In sum, the level of this book is highly uneven, but it contains some valuable 
material for teachers of survey methods or those professionally engaged in sampling 
operations. 


A Critique of the United States Income and Product Accounts. Studies in Income and 
Wealth, Volume 22. Conference on Research in Income and Wealth. A report on the Na- 
tional Bureau of Economic Research. Princeton, New Jersey: Princeton University Press, 
1958. Pp. x, 589. $11.50. 


Kennetu E. Bovutpina, University of Michigan 


This noble volume is a reviewer’s despair. It is impossible to summarize the work 
of twenty-four authors and the lifework of at least one of them, and there is no 
substitute for the richness of content and the many closely reasoned arguments of 
the book itself. For a dull-sounding work on economic statistics the book is surpris- 
ingly dramatic in form; at times it has almost the quality of a trial or even an inquisi- 
tion. Its hero is George Jaszi, and the defendant is the National Income and Product 
Accounts of the United States, as produced by the National Income Division of the 
Department of Commerce. Jaszi might well be called the papa of the National Income 
accounts—many progenitors may have sired them, but Jaszi raised them from child- 
hood. This casts a certain papal unction over the volume, which has in places the air 
of a conclave of statistical cardinals in strong but respectful debate with the Pope. 
The volume opens with a monograph length re-examination of the conceptual basis 
of the accounts by Jaszi himself, in which is distilled the wisdom and the doubts of 
many years of experience in the game of National Income Accounting. The “income 
side” is then discussed by R. T. Bowman, R. A. Easterlin, Morris Cohen and M. R. 
Gainsbrough, and the “product side” by E. E. Hagen, E. C. Budd, and K. D. Ross. 
T. C. Schelling discusses the design of the accounts in a crisp and incisive paper; 
D. W. Lusher makes a brief, and Morris Copeland a more extended, plea for more 
flow data; E. C. Budd discusses the distributive shares; V. L. Bassie appraises the 
uses of the data for short term analysis; Paul Kircher surveys the business uses of the 
data; J. W. Kendrick discusses the measurement of real product. Then follow seven 
papers on special segments of the accounts—Eric Schiff on “Business Plant and 
Equipment,” Solomon Fabricant on “Capital Consumption and Formation,” R. W. 
Goldsmith on “Saving,” Karl A. Fox on “Agricultural Income,” Stanley Lebergott 
on “Entrepreneurial Income,” Joseph Lerner on “Extractive Industries,” and Clark 
Warburton on “Financial Intermediareis.” This enormous menu is topped off by a 
paper by Geoffrey H. Moore on “Seasonal Adjustments.” To all these papers Jaszi 
makes some reply, and the participants indulge in the crossfire which has become so 
engaging a feature of these conferences. 

On the whole it is the view of this reviewer that within the framework in which 
they are set the Accounts stand up very well to criticism, and that out of these 
many skirmishes Jaszi emerges as an impressive figure: his very frankness, honesty, 
and willingness to learn disarms criticism. National income and product accounts are 
a strange structure: they start from a pretty solid statistical basement of census and 
tax returns and end in cloud castles of estimation. Nobody quite knows where they 
should begin and end; there are always good arguments for adding something to or 
subtracting something from the aggregates. Their construction is almost a matter of 
the aesthetic judgment; there is no clearly “right” way of doing it. Nevertheless they 
make a certain amount of sense and are enormously useful in outlining the major 
dimensions of the economy. This is an area perhaps where we should not worry about 
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too great refinement—a porch here and an attic there, a room subdivided here or 
enlarged there would all be nice but not necessary, and meanwhile there are always 
very good arguments for keeping things as they are. There are, it is true, some major 
grumbles: nearly everybody would like to see greater refinement in the government 
accounts, nobody is quite happy with the depreciation accounts, and those rare souls 
who understand them are even unhappier about the depletion accounts. The treat- 
ment of financial institutions is odd to say the least of it. Still, wherever things are 
bad it is very hard to spell out anything that is clearly better, and on the whole the 
accounts stand up as an enduring compromise between the ideal and the attainable. 

After such a rich feast it is a little ungracious to complain about what was not on 
the menu. However, it would have been interesting to have a paper on the actual 
routine and operations of the National Income Division; with all the volumes of 
explanation of concept the outsider still does not have a clear picture of the actual 
operations of the office. Then what seems to me the major deficiency in the accounts 
is nowhere mentioned in the whole volume—the almost total absence of geographical 
breakdown. One constantly recurring theme in the volume is the desirability as far as 
possible of having the accounts invariant with respect of institutional changes—at 
least up to a point. The famous criticism that when a man marries his housekeeper 
the national income falls is an example of the failure of this invariance at the level 
of the household. Changes in the frontier of the nation, however, are a much more 
fundamental institutional shift: we are certainly not going to want to eliminate the 
figure for the national product or income of the continental United States just be- 
cause we have added two more stars to the flag. Statisticians generally seem to have 
a positive hatred of geography, yet for many purposes the distribution of certain 
variables like income over the face of the globe is the really interesting phenomenon, 
rather than any arbitrary collection of aggregates related to certain areas. One hank- 


ers for a marriage of statistics and geography (it is fantastic, for instance, how frag- 
mentary the information is even in price geography, and income geography is no bet- 
ter). This, however, is no criticism of the present volume, but of the state of the art. 
To all specialists in national income statistics this volume is a must, and even many 
non-specialists will find it of interest. 


Recent Trends in Fertility in Industrialized Countries. Population Studies No. 27- 
United Nations, Department of Economics and Social Affairs. New York: Columbia Uni- 
versity Press, 1958. Pp. xi, 182. Paper, $2.00. 


Witson H. Bureau of the Census* 


This report of the United Nations stems from the interest of their Population Com- 
mission in the “baby boom” and in whether this has altered the long-term prospects 
for future births and population change. International comparisons often reveal more 
about a subject than can be obtained from one-country studies; the present report is 
no exception. 

The report gives basic data and a valuable analysis of the recent birth rates in 
twenty selected countries in the European cultural sphere: Australia, Austria, Bel- 
gium, Canada, Czechoslovakia, Denmark, Finland, France, Great Britain (England 
and Wales, Scotland), Ireland, Italy, The Netherlands, New Zealand, Norway, Por- 
tugal, Spain, Sweden, Switzerland, and the United States of America. These countries 
are selected largely on the basis of the availability of reliable detailed statistics, but 
Japan is excluded as not being in the European cultural sphere. Not all of the types of 
data shown are available for all of the countries. 

The analysis is intentionally limited to demographic aspects of fertility, as the 


* The statements made herein do not necessarily reflect the views of the U. S. Bureau of the Census. 
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official reports of the countries are the main sources of data. The few unofficial studies 
of the effect of changing social, economic, and political conditions on fertility are given 
little mention. 

A brief account is given of the pitfalls and cautions to be observed, although rough 
international comparisons of data can be made. For example, some countries count 
as births only those babies who are living at the time of registration of the birth; some 
countries count all babies who lived for even an instant after birth; and some data are 
on confinements of women, regardless of whether the outcome is a stillbirth, a single 
live birth, or > » "ural birth. The completeness of birth registration and the quality of 
the populati .n bw ses for birth rates vary. 

The tren’ 's of the crude birth rates are examined for as many years in the period 
from 1880 t. 1954 as data for each country permit. All countries-shown have had 
long-term dec: es in fertility, but the magnitudes of the declines have differed. All 
have had large increases during and since World War II. In countries where the birth 
rate was at a very low level in the 1930’s, the usual trend was stationary between 
1934 and the outbreak of World War II. Thus, the long-term decline stopped for at 
least two decades in many industrialized countries. In other countries, the birth rate 
was high in the 1930’s but declined until the outbreak of World War II. In most 
countries the birth rate reached a peak about 1946-47 and then receded by varying 
amounts. In most European countries, the birth rate in 1954 was only a little above 
the prewar level. In Canada, New Zealand, the United States, and Australia the birth 
rate in 1954 was far above the prewar level. 

Successive chapters explore in some depth the increase in the birth rate and its 
implications for future trends. Trends in annual data on births classified by age of 
mother, by duration of mother’s marriage, or by order of the birth (mother’s first 
child or other specified number) show that the incrcase in birth rates in the World 
War II and postwar period came about from changes in the timing of births. Thus, 
there were relativeiy more young mothers, more mothers reporting a recent marriage, 
and more mothers reporting that the child was their first or second child. The impli- 
cations of such data for an eventual increase in the average size of families are dis- 
cussed, with inconclusive results. The changes in the age and sex composition of the 
population are shown to have generally impeded the rise in the crude birth rate dur- 
irg the postwar period. Virtually all countries reporting had a larger proportion of 
married persons of young age and a lower medicn age at first marriage. 

Census-type data on the number of children ever born to existing marriages are 
compared for various countries. The census data are used to study trends in fertility 
by specified marriage durations, by age at marriage, and by successive marriage co- 
horts. The census data are available for only a few dates but are used to reconstruct 
past trends. A similar analysis is made of birth registration data cumulated over 
periods of time for specified marriage cohorts. The annual vital statistics are also 
cumulated over periods of time for specified birth cohorts of women and compared 
to show changes in the timing of births among successive cohorts. The point is prop- 
erly made that in the future the trends in fertility will be increasingly sensitive to 
changes in social and economic conditions, which can seldom be foreseen. Hence, the 
projection of trends in fertility is still a speculative task, notwithstanding the possi- 
bility that the long-term decline of fertility has come to an end in some countries. 

A bibliography of selected official reports is given, by countries, and an appendix 
presents detailed data used in the report. There is no index. 

The report would have been strengthened if it had stressed at an early point Hajnal’s 
admonition that it is probably fruitless to search for the “right” method of measuring 
fertility. The relative merits of analyzing birth registration data by conventional 
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methods or by cumulations into birth and marriage cchorts, of using vital statistics 
or census data, of using birth cohorts or marriage cohorts, etc., are stated with special 
objectives in mind. Different analysts from different countries have different objec- 
tives. Thus, statements of justification for showing data in one form rather than 
another occasionally conflict. 

Some countries collect census data on the fertility of existing marriages only and 
some collect data for all women or all women who have ever been married. The only 
chapter that makes use of census data is limited to the study of the fertility of existing 
marriages. This reviewer would have liked additional material on reproduction losses 
arising from the failure of some people to marry and have children or from the inter- 
ruption of marriages before the end of the reproductive period. In low fertility areas, 
the replacement of population may depend on the extent to which the fertility of 
existing marriages compensates for the infertility which arises from such sources. 
Also, census data that are limited to marriages existing on the census date are not 
entirely representative of all marriages. Thus, the average age at first marriage tends 
to be slightly older for couples who are still married at the census date than for per- 
sons whose first marriage has been broken by widowhood, separation, or divorce. 

The minor limitations of this report detract only in a negligible way from its use- 
fulness as a source of varied data and methodology, and the analysis is valuable and 
competently done. 


Output, Labour and Capital in the Canadian Economy. William C. Hood and Anthony 
Scott. Ottawa: Royal Commission on Canada’s Economic Prospects, Fe! suary, 1957. 
Pp. 513. No price listed. 


Harry E. McAuuister, National Bureau of Economic Research and 
Washington State University 


In the authors’ words, the task assigned them by the Royal Commission “. . . was 
to forecast (for Canada) the ranges within which the measures of population, labor 
force, output, expenditure, capital, and certain of their constituent elements might 
be expected to lie.” These forecasts were made for five-year periods beginning with 
1960 for population, 1965 for the other variables, and ending with 1980. 

Chap. 2, which begins the basic work of the book, is concerned with identifying 
the elements of economic growth believed essential to the Canadian forecasts. 
Utilizing data and theory from American, British, and Canadian sources, growth in 
the United States, the United Kingdom, and Canada is analyzed. The authors, at 
the end of the chapter, set forth what they believe to be the key ratios in the analysis 
of economic growth, and classify them as constant ratios, growing ratios, and ratios 
whose movements are uncertain. 

The constant ratios include: “(a) an approximately fixed relationship between 
saving and output, (b) a related constancy of the ratio of capital to output, (c) a 
significant stability of the ratio of labor income to national income, and (d) a limited 
range of variation among the rates of growth (especially of output or income per head) 
of mature economies or of the regions of an economy.” 

Growing ratios set forth are: “(a) output per worker and output per man-hour of 
work, (b) income per unit of work; that is, wage rates, (c) the ratio of capital stock 
to labor input, (d) the ratio of leisure to other types of consumption, and (e) the ratio 
of output to energy consumed.” 

The third set of ratios, the movements of which tend to be uncertain, are: “(a) the 
proportion of the population that is in the labor force, (b) the proportion of the 
national expenditure that is imported, and (c) the proportion of the national output 
that is exported.” 
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In Chap. 3 the theory of economic growth believed relevant to the forecasts is pre- 
sented. The determinants of growth are stated to be the “spirit of the people,” 
“economic institutions,” “the knowledge of techniques,” and a less important factor, 
the population of the economy. From a statement of these determinants, the chapter 
then turns to an examination of three mathematical models of the economy. These 
models, all of the Cobb-Douglas variety, are analyzed to determine the economic 
implications of each. 

Chap. 4, a key chapter of the study presents forecasts of the magnitude of the 
population and the labor force. Forecasting methods are carefully set forth so that 
assumptions can be examined as the forecast is being derived. Three alternative as- 
sumptions were made for the magnitude of net flow of imigrants, and three resulting 
forecasts were obtained for both population and labor force. Regional distribution 
of population and Jabor force was made by province groupings. 

Chap. 5 describes the process and gives estimates for Gross Domestic and Gross 
National Product. On p. 194, an excellent schematic diagram of the process used in 
attaining these variables is presented. In brief, the forecasting procedure develops as 
follows: The labor force estimates of Chap. 4 are used as a basis to derive estimates 
of the empioyed labor force. These estimates assume full employment at a level of 
97 per cent of the labor force. A division is then made of the employed labor force 
_ among three main sectors of the economy: (1) government and community service, 
(2) agriculture, and (3) the business sector. The output that could be expected on 
the average from each man employed in each sector is estimated in 1949 dollars. A 
simple multiplication by the number of men employed is carried out for each sector 
to yield GDP for each of the three sectors. The three elements of GDP are then totaled 
to give total GDP at factor cost, excluding rents. The inclusion of rents yields GDP 
at factor cost, and addition of indirect taxes gives the sum of GDP at market prices. 
To this aggregate interest, net (domestic minus foreign) dividends and similar pay- 
ments are added to attain the final aggregate of GNP. 

The methods used to estimate output in each of the three sectors are discussed, as 
well as the adjustments involving indirect taxes and interest and dividends. More 
influence is given to the recent growth rates, and alternative forecasts are made in 
the business sector assuming different alternative compounded average rates of 
growth. 

In the government and community sector estimate, the forecast assumes that out- 
put per man will remain constant on the basis that the statistical record of the past 
showed this phenomenon to exist. 

The “guess” concerning payments of interest and dividends abroad is one of the 
few estimates for which no foundation is advanced. The reader is warned that it is a 
guess but the basis is not made explicit. 

Chap. 6 sets forth the record of “fixed capital” (gross domestic investment for new 
nonresidential construction and new machinery and equipment) for Canada, and the 
method of forecasting its accumulation in the future. Reasons for choosing the cu- 
mulation methods in preference to the capitalization or the adding of assets method 
are stated. In using the cumulation method, it has been assumed that the lifetime 
of an asset can be stated as a period of years, rather than using distributions of life 
expectancy. Also, because the authors work with data that classify investment ex- 
penditures by industry, they felt it necessary to assume that the service lives of all 
assets in a given industry can be averaged into one representative figure, and also 
that the annual expenditures by a given industry contains, from one time period to 
another, unchanging proportions of each kind of asset. 
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In Part IV of the chapter, the authors explore the relationship between their esti- 
mates of capital, and the concept of capital as ordinarily used in economic theory. 
They conclude that the environmental situation is such that an analyst must abandon 
many of the precepts of capital theory if he is to estimate capital magnitudes. 

They also consider the capital-output ratio and its change over time, the relative 
importance of machinery and equipment, and some of the questions involved in the 
analysis of depreciation. In considering the capital-output ratio, they conclude that 
there are five important forces at work upon it that apparently work with enough 
counterbalancing effects over time such that the ratio tends to remain stable. They 
conclude this not only for Canada but also for other countries. Their analysis of the 
relative importance of machinery and equipment leads them to conclude that equip- 
ment has been becoming more important in the total of fixed capital since the 1920’s 
for both the United States and Canada. 

In the last chapter, components of the aggregates previously forecasted are reported 
upon. The distribution of the civilian employed labor force and the industrial distri- 
bution of Gross Domestic Product are shown as averages for 1979-81. Estimates of 
gross national expenditure, saving, and investment are similarly reported. 

Some statistical errors are always inevitable in a book that presents as much in- 
formation as this one does. A number of these have already been discovered by the 
authors, and an errata sheet is distributed with the book. In addition to these, there 
are some others. A number are present in Table 6.1 on p. 237. Errors are always un- 
fortunate, and especially so when data are used to explain a procedure as they are 
in this table. The 1930 figure for col. 2 should be 3.3 instead of 3.5. Also, a number 
of the figures in col. 7 do not result from cumulating the given col. 6. These errors 
could have arisen from an incomplete change from previous rounding. One more item 
with regard to this table: the formula heading column 6 is incorrect; it should be cor- 
rectly expressed as J, —(1/L)G@n1. 

In appraising a book, one can measure what was done in terms of what was at- 
tempted. This book attempted to forecast population, labor force, output, expendi- 
ture, capital, and some of the component elements of these aggregates for Canada. 
These forecasts were made within the framework of certain theoretical assumptions 
which were made explicit in almost every instance. In appraisal, after asking “Were 
the forecasts necessary?” and receiving “Yes” as an answer, it is necessary to ask if 
they followed proper assumptions with appropriate statistical data and techniques 
while being cast into a proper framework of economic theory. The answers to these 
questions are again “Yes,” although the discussion in Chap. 3 of the determinants of 
economic growth tends to be somewhat sterile because it is never related in any 
meaningful way to the forecasting model. 

The authors have many debts which they freely acknowledge. They have drawn 
on a rich reservoir of statistical data, techniques, and theory, and have utilized 
them to advantage in setting forth their forecasts. While they have developed a vast 
amount of new statistical data about economic aggregates for Canada, they have ac- 
complished more than this. They have given economists a large amount of material 
with which to theorize and have suggested an appropriate framework within which 
this theorizing might be fruitful. As a by-product, they have to a degree enriched the 
theory of economic growth and statistical analysis by their findings, their techniques, 
and by their frank admissions about problems and limitations of the analyses and 
forecasts. Students of the aggregate economic magnitudes and growth will find this a 
valuable book. 
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Operational Research in Practice. Mar Davies and Michel Verhulst, Eds. New York: 
Pergamon Press, Inc., 1958. Pp. ix, 201. $12.00. 


Samuet H. Brooks, General Analysis Corporation 


In this book are papers and discussions presented at a NATO Conference in 1957 
to interest executives in operational research and to demonstrate specific applications 
of the latest techniques. Topics included are the status of and organization for OR 
in the member nations, several World War II and contemporary military applications 
of OR, and a few rather philosophical discussions of OR. A rich diversity of opinion— 
inherent in almost any international conference—is presented as to what OR is, how 
it got started—Sir Robert Watson-Watt or Archimedes—and where it will lead. 
Statistics, as such, is scarcely mentioned although it is one of the more fundamental 
techniques of OR. However, this is not the only technical aspect that gets scant treat- 
ment, since the papers are mainly about OR rather than on OR. The paper “Scientific 
Aids to Decision Making—A Perspective” by J. R. Goldstein, is an excellent exposi- 
tion of methods of approach to OR problems. In the discussion a general points out 
that this is well covered in the “Staff Officers’ Field Manual.” Another paper which 
I found to be outstanding is “Future Fields for Operational Research in the NATO 
Nations” by H. A. Sargeaunt. OR, he indicates, should be done by first class scientists 
putting their best efforts on novel and important problems and that emphasis should 
be given to developing it as a discipline in its own right. A more extensive review by 
R. H. Colleutt, British Iron and Steel Research Association can be found in Operations 
Research, Volume 7 (1959), 132-3. 


Punched Cards: Their Application to Science and Industry, Second Edition. Robert S. 
Casey, James W. Perry, Madeline M. Berry and Allen Kent, Eds. New York: Reinhold 
Publishing Corp., 1958. Pp. x, 697. $15.00. 


C. L. Perry, Stanford Research Institute 


The design of information storage and retrieval systems becomes increasingly im- 
portant as the already large amount of technical literature expands. About half of 
the thirty chapters of this book are concerned with the use of edge-punched cards 
(i.e., cards with holes, notches and slots punched along one or more sides of the card) 
for locating information. The book is also designed to be used as an operation manual 
for edge-punched card systems. Other applications such as the use of punched cards 
in computations, accounting, and control applications are briefly mentioned or 
overlooked. 

The second edition of Punched Cards is a substantially reworked and enlarged 
version of the previously reviewed first edition [c.f., review by H. P. Hartkemeier, 
this journal, 48, (1953), 372-4]. The first edition contained a chapter on the use of 
punched cards in computation. This chapter, which contained some misleading sta- 
tistical statements (c.f., previous review), is not included in the second edition. The 
second edition was written by 32 authors, 12 of whom also contributed to the first 
edition. As in the first edition little attempt has been made to interrelate the contribu- 
tions of the various authors. 

The low equipment cost for punching and sorting edge-punched cards (e.g., Key- 
sort, E-Z Sort and Rocket Cards) makes them particularly useful for locating infor- 
mation in files of modest size and in applications such as the classification of observa- 
tions. The cards may be notched with a manual punch and sorted with a long needle. 

The central problem in the design of storage and retrieval systems is the develop- 
ment of a system for classifying the information. In this book classification systems 
that reflect the inherent taxonomy of the subject are described for anesthesia, 
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chemistry, genetics, metallurgy, and refinery reports. About half of the book is con- 
cerned with these practical applications. Other topics discussed include: a mathe- 
matical description of coding schemes; very brief descriptions of IBM and Remington 
Rand card processing machines (sorter, collator, tabulator and calculating punch); 
literature organization and search; linguistic analysis. 

One interesting suggestion of interest to data analysts is the use of edge-punched 
cards to locate interrelations between variables. Consider the values of the variables 
for an observation to be indicated by notches in the edges of the cards, each variable 
being assigned a given segment of the edges. The cards may be sorted so the values 
for any given variable are in ascending order. An indication of possible functional 
relations between this variable and any one of the other variables may be examined 
visually by observing the structure of the notch formations in the appropriate edge 
sector. 

The book is concluded with a short chapter on data processing machines and an 
annotated bibliography on uses of punched cards containing 400 references. 

The book should be most useful to scientists and information specialists interested 
in developing a low cost mechanized information retrieval system for a modest file. 


Mill Test Procedures. N. L. Enrick. New York: Modern Textiles Magazine and Rayon 
Publishing Corporation, 1956-58. Pp. 53. Paper. $3.50. 


L. H. C. Tippert, British Cotton Industry Research Association 


This publication is addressed to textile technologists and gives routine test pro- 
cedures for use in textile spinning, weaving and knitting mills. Included are sampling 
instructions, frequencies with which tests should be made, and forms on which results 
can conveniently be recorded; and there is some advice on the evaluation of results. 
Statistical methods are clearly at the root of many of the procedures, but these are 
not described. The publication should be very useful to mill men. 
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stitute of Technology, 1959. $3.00. Paper. 

Bassie, V. Lewis. Economic Forecasting. 
New York, New York: McGraw-Hill 
Book Company, Inc., 1958. $8.75. 

Bjerve, P. J. Planning in Norway 1947- 
1956. Amsterdam, Holland: North- 


Holland Publishing Company, 1959. 
$9.25 


Blom, Gunnar. Statistical Estimates and 
Transformed Beta- Variables. New York, 
New York: John Wiley and Sons, Inc., 
1959. $5.00. 

Board of Governors of the Federal Reserve 
System, Division of Research and Sta- 
tistics. All-Bank Statistics— United 
States 1896-1955. Washington, D. C.: 
Federal Reserve System, 1959. Price not 
marked. Paper. 

Boudreau, Frank G., Kiser, Clyde V. (Edi- 
tors). Thirty Years of Research in Hu- 
man Fertility, Retrospect and Prospect. 
New York, New York: Milbank Me- 
morial Fund, 1959. $1.00. Paper. 

Bowker, Albert H., Lieberman, Gerald J. 
Engineering Statistics. New York, New 
York: Prentice-Hall, Inc., 1959. $11.00. 

Bowman, Edward H., Fetter, Robert B. 
(Editors). Analysis of Industrial Opera- 
tions. Homewood, Illinois: Richard D. 
Irwin, Inc., 1959. $7.95. 

Bowman, May Jean (Editor). Ezxpecta- 
tions, Uncertatnity, and Business Be- 
havior. New York, New York: Social 
Science Research Council, 1958. $2.00. 
Paper. 

Brazer, Harvey E. City Expenditure in the 
United States (Occasional Paper 66). 
New York, New York: National Bureau 
of Economic Research, 1959. $1.50. Pa- 


per. 

Brems, Hans. Output, Employment, Capital, 
and Growth. New York, New York: 
Harper and Brothers, 1959. $6.00. 

Bureau of Business Research, College of 
Business Administration, Boston Uni- 
versity. New England Economic Indi- 
cators. Boston, Massachusetts: Bureau of 
Business Research, Boston University 
College of Business Administration, 1957. 
Price not marked. Paper. 


Cairncross, Alec. The International Bank 
for Reconstruction and Development, Es- 
says in International Finance, No. 33. 
Princeton, New Jersey: International 
Finance Section, Princeton University, 
1959. Price not marked. Paper. 

Chalfant, William Bergen. Pris er of Free 
Government. New York, New York: 
Philosophical Library, Inc., 1959. $3.00. 

Committee on Economic Growth. The 
State and Economic Growth. New York, 
New York: Social Science Research 
Council, 1959. $3.75. 

Connor, W. S., Zelen, Marvin Fractional 
Factorial Experiment Designs for Factors 
at Three Levels. Washington 25, D. C.: 
National Bureau of Standards, 1959. 
$.30. Paper. 

Conselho Nacional de Estatfstica. Anudrio 
Estatistico De Brasil—1958, Ano XIX. 
Rio de Janeiro, Brasil: Ibge-Conselho 
Nacional de Estatistica, 1958. Price not 
marked. Paper. 

Cowles Foundation. Report of Research Ac- 
tivities, July 1, 1956-June 30, 1958. New 
Haven, Connecticut: Cowles Foundation 
for Research in Economics at Yale Uni- 
versity, 1959. Price not marked. Paper. 

Cox, Oliver C. Foundations ‘ Capitalism. 
New York, New York: Philosophical 
Library, Inc., 1959. $7.50. 

Davis, Thomas Jeff. Cycles and Trends in 
Textiles. Washington 25, D.C.: U. S. 
Department of Commerce, 1958. $.40. 
Paper. 

Dayhaw, Lawrence T., Manuel De Stia- 
tristique. Ottawa, Canada: Editions de 
l'Universite d’Ottawa, 1958. Price not 
marked. Paper. 

Dominion Bureau of Statistics, Research 
and Development Division, Special Proj- 
ects Section. Incomes, Liquid Assets and 
Indebtedness of Non-Farm Families in 
Canada 1955. Ottawa, Canada: Edmond 
Cloutier, 1958 (Canadian Government 
Printing Service)., $.75. Paper. 

Downie, N. M., and Heath, R. W. Basic 
Statistical Methods. New York: Harper 
and Brothers, 1959. $4.50. 

Duncan, Acheson J. Quality Control and In- 
dustrial Statistics. Homewood, Illinois: 
Richard D. Irwin, Inc, 1959. $9.00. 

East African Statistical Department. Bibli- 
ography of Economics in East Africa. 
Nairobi, Kenya Colony: East African 
Statistical Department, 1958. Price not 
listed, Paper. 

East African Statistical Department— 
Kenya Unit. KEN YA—Survey of In- 
dustrial Production, 1956. Kenya Colony, 
East Africa: East African Statistical De- 
an Kenya Unit, 1958. 7 shillings. 

‘aper. 

East African Statistical Department— 
Kenya Unit. Statistical Abstract—1958. 
Kenya Colony, East Africa: East African 
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Statistical Department, 1958. 12/50 
shillings. Paper. 

East African Statistical Department, Tan- 
— Unit. Report on the Census of the 

on-African Population Taken On the 
Night of 20th/21st February, 1957. Tan- 
anyika, East Africa: Government 
rinter, 1958. Sh. 25. Paper. 

East African Statistical Department, Tan- 
—_ Unit. Tanganyika Population 

ensus 1957: General African Census, 
August 1957, Tribal Analysis Part I; 
Territorial, Provincial, District. Nairabi, 
Kenya Colony, East Africa: East African 
Statistical Department, 1958. Sh. 5. 
Paper. 

East African Statistical Department, 
Uganda Unit. 1958 Statistical Abstract. 
Kenya Colony, East Africa: East African 
Statistical Department, Uganda Unit, 
1958. Price not marked. Paper. 

Eisenschitz, R. K. Siatistical Theory of Ir- 
reversible Processes. New York, New 
York: Oxford University Press, 1958. 
Paper. 

Enrick, Norbert L. Quality Control. New 
York, New York: The Industrial Press, 
1959. Price not marked. 

Epstein, Ralph C. Making Money in Today’s 
Market. New York, New York: Double- 
day and Co., 1959. $1.75. Paper. 

Federal Reserve System Committee. mT 
Federal Funds Market. Washington 
D.C.: Federal Reserve System, 159" 
$1.00. Paper. 

Fein, Rashi. Economics of Mental Illness, 
Monograph Series No. 2. New York, 
eB York: Basic Books, Ine., 1958. 

3.00 

Grabbe, Eugene M., Ramo, Simon, and 
Wooldridge, Dean E. (Editors) Hani- 
book of Automation, Computation, and 
Control— Volume I, Control Fundamen- 
tals. New York, New York: John Wiley 
and Sons, Inc., 1958. $17.00. 

Gumbel, E. J. Statistics of Extremes. New 
York, New York: Columbia University 
Press, 1958. $15.00. 

Gupta, H. C. Problems and Processes of 

conomic Planning in Underdeveloped 
Economies (With Special Reference to 
India). Allahabad, India: Kitab Mahal, 
1958. 5/Rs. 

Gupta, Rajnarain, Mathur, R. N. Devel- 
opment and Working of the Indian Con- 
stitution. Allahabad, “india: Kitab Mahal, 
1959. 750/Rs. 

Haggard, Ernest A. Jntraclass Correlation 
and the Analysis of Variance. New York, 
New York: Henry Holt and Co., 1958. 


$2.90. 

Hall, Marshall, Jr. The Theory of Groups. 
New York, New York: The Macmillan 
Company, 1959. $8.75. 

Harbury, C. D. The Industrial Efficiency of 
Rural Labour. Cardiff, Wales: University 
of Wales Press, 1958. Sh. 35. 

Haring, Albert, Yoder, Wallace O. (Edi- 
tors). Tradin; Stamp Practice and Pric- 
ing Policy. Bloomington, Indiana: Indi- 
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Bureau of Business Re- 
search, Schoo f Business, 1958. $6.00. 

Hauser, Philip and Duncan, Otis 
Dudley (Edivors). The Study of Popula- 
tion, An Inventory and Appraisal. 
Chicago, Illinois: University of Chicago 
Press, 1959. $15.00. 

Hill, Samuel E., and Harbison, Frederick. 
Manpower and Innovation in American 
Industry. Princeton, New Jersey: In- 
dustrial Relations ’Section, Princeton 
University, 1959. $2.00. Paper. 

Hoch, Irving. Forecasting Economic Ac- 
tivity for the Chicago Region: Final 
Report. Chicago, Illinois: Chicago Area 
Transportation Study, 1959. Price not 
marked. Paper. 

Holmes, Alan R. The New York Foreign 
Exchange Market. New York, New York: 
Federal Reserve Bank of New York, 
1959. $.50. Paper. 

International Labour Office. International 
Standard Classification of Occupations. 
Washington, D.C.: Intemational Labour 
Office, 1958. $3.50. Sh. 21. 

Jury, Eliahu I. Sampled-Data Control Sys- 
tems. New York, New York: John Wiley 
and Sons, Inc., 1958. $16.00. 

Kemeny, John G., Mirkil, Hazelton, Snell, 
. Laurie, Thompson, Gerald L. Finite 
fathematical Structures. New York, New 

York: Prentice-Hall, Inc., 1959. $7.95. 

Khare, G. P. Planning In India. Allahabad 
India: Kitab Mahal, 1958. Rs. /3.75. 

Kimmel, Lewis H. Federal Budget and 
Fiscal Policy 1789-1958. Washington, 
D.C.: The Brookings Institution, 1959. 


ana Universit 


$5.00. 
Kullback, Solomon. Information Theory 


and Statistics. New York, New York: 
John Wiley and Sons, Inec., 1959. $12.50. 
Langer, Rudolph E. (Editor). On Numerical 
Approximation. Proceedings of a Sym- 
posium Conducted by the Mathematics Re- 
search Center, U. S. Army ai the Uni- 
versity of Wisconsin, Madison, April 
21-23, 1958. Madison, Wisconsin: The 
University of Wisconsin Press, 1959. 
4.5 


Li, Ching Chun. Numbers from Experi- 
ments. Pittsburgh, Pennsylvania: The 
Boxwoad Press, 1959. Paper, $2.75. 
Cloth, $4.00. 

Lichtenberg, Robert M. The Role of Middle- 
man Transactions in World Trade. New 
York, New York: National Bureau of 
Economic Research, Inc., 1959. $1.50. 
Paper. 

Mamoria, C. B. Tribal Demography in 
India. Allahabad, India: Kital Mahal, 
1959. Rs/5. 

Markham, Jesse W. The Fertilizer Indus- 
try: Study of an Imperfect Market. 
Nashville, Tennessee: Vanderbilt Uni- 
versity Press, 1958. $6.00. 

Markowitz, Harry M. Portfolio Selection, 
Efficient Diversification of Investments, 
Monograph 16. New York, New York: 
John Wiley and Sons, Inc., 1959. $7.50. 

Melman, Seymour. Decision-Making and 
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Productivity. New York, New York: 
John Wiley and Sons, Inc., 1958. $7.00. 

Menges, Giinter. Stichproben Aus End- 
lichen Gesamtheiten; Theorie Und Tech- 
nik. Frankfurt, Germany: Frankfurter 
Wissenschaftliche Beitrige, 1959. dm. 
8.50. Paper. 

Meyer, John R., Peck, Merton J., Stena- 
son, John, Zwick, Charles. The Econom- 
ics of Competition in the Transportation 
Industries. Cambridge, Massachusetts: 
Harvard University Press, 1959. $7.50. 

Morgenstern, Oskar. /nternational Finan- 
cial Transactions and Business Cycles. 
Princeton, New Jersey: Princeton Uni- 
versity Press, 1959. $12.00. 

National Bureau of Economic Research, 
Inc. The Study of Economics Growth, 
Thirty-ninth Annual Report. New York, 
New York: National Bureau of Economic 
Research, Inc., 1959. Price not marked. 
Paper. 

Nelson, James C. Railroad Transportation 
and Public Policy. Washington, D.C. 
The Brookings, Institution, 1959. $7.50. 

Nelson, Richard Lawrence. The Selection of 
Retail Locations. New York, New York: 
F. W. Dodge Corporation, 1959. $9.00. 

Neudruck, Berichtigter. Formeln Und 

« Tabellen Der Mathematischen Statistik. 
Berlin, Gottingen, Heidelberg, Germany: 
Springer-Verlag, 1958. dm 12.60. 

Ohkawa, Kazushi, Umemura, M., Shino- 
hara, M., Ito, M., Noda, T. The Growth 
of the Japanese Economy. Tokyo, Japan: 
Kinokuntya Bookstore Co., Ltd , 1957. 
Price not marked. 

Pedoe, Dan. The Gentle Art of Mathematics. 
New York, New York: The Macmillan 
Company, 1959. $3.50. 

Pfanzagl, i Die Axiomatischen Grundlagen 
Einer Aligemeinen Theorie. Des Messens. 
Wurzburg, Germany: Physica-Verlag, 
1959. dm 14-. Paper. 

Roberts, David R. Executive Compensation. 
Glencoe, Illinois: The Free Press of 
Glencoe, 1959. $6.00. 

Salzer, H. E. (National Bureau of Stand- 
ards) Tables of Osculatory Interpolation 
Coefficients. Washington, D.C: National 
Bureau of Standards, 1959. $.30. Paper. 

Schlaifer, Robert. Probability and Statistics 
for Business Decisions; An Introduction to 
Managerial Economics Under Uncer- 
tainty. New York, New York: McGraw- 
Hill Book Co., 1959. $11.50. 

Shubik, Martin. Strategy and Market Struc- 
ture. New York, New York: John Wiley 
and Sons, Inc., 1959. $8.00. 

Sinha, Gorakh Nath. An Introduction to 
Food Economics. Allahadad, India: Kitab 
Mahal, 1959. Rs. 3/. 

Snyder, Eleanor M. Public Assistance Re- 
cipients in New York State January- 
February 1957: A Study of the Causes of 
Dependency During a Period of High- 
Level Employment. New York, New 
York: State of New York Interdepart- 
on Low Incomes, 
1958. Paper. 


Theil, H. Contributions to Economic Analy- 
sis: Economic Forecasts and Policy. 
Amsterdam, Netherlands: North-Hol- 
land Publishin Company, 1958. $13.35. 

Thurstone, L. 4 The easurement of 
Values. Chicago, Illinois: The University 
of Chicago Press, 1959. $7.50. 

Ulmer, Melville J. Economics, Theory and 
Practice. Boston, Massachusetts. Hough- 
ton Mifflin Company, 1959. $6.95. 

Umited Nations Department of Economic 
and Social Affairs. A Manual For Eco- 
nomic and Functional Classification of 
Government Transactions. New York, 
New York: Columbia University Press, 
1958. $2.00. Paper. 

United Nations Department of Economic 
and Social Affairs. Multilingual Demo- 
graphic Dictionary: English Section, Pop- 
ulation Studies No. 29. New York, New 
York: Columbia University Press, 1958. 
$.50. Paper. 

United Nations Economic Commission for 
Europe. Annual Bulletin of Housing and 
Building Statistics for HEurope—19657. 
New York, New York: Columbia Uni- 
versity Press, 1958. $.40. Paper. 

United Nations Economic Commission for 
Europe. Financing cd Housing in Europe. 
New York, New York: Columbia Uni- 
versity Press, 1958. $.80. Paper. 

United Nations Food and Agriculture Or- 
ganization, Forestry Division. Yearbook 
of Forest Products Statistics. New York, 
New York: Columbia University Press, 
1959. $2.50. Paper. 

United Nations Department of Economic 
and Social Affairs, Statistical Office. 
Statistical Yearbook 1958. New York, 
New York: Columbia University Press, 
1959. Cloth, $8.00, paper, $6.50. 

United Nations Department of Economic 
and Social Affairs. Yearbook of Inter- 
national Trade Statistics 1956, Volumes I 
and II. New York, New York: Columbia 
Press, 1957. Volume I, $7.00, 
Volume IJ, $1.50. Paper. 

United Nations International Labour Of- 
fice. Yearbook of Labour Statistics, 1958. 
Geneva, Switzerland: United Nations, 
1958. $5.00. Paper. 

United States Department of Commerce, 
Bureau of the Census. Location of Manu- 
facturing Plants by Industry, County, and 
Employment Size: 1954, Part 5—Chemi- 
cal and Products; Petroleum and Coal 
Products; Rubber Products. Washington, 
D.C.: Bureau of the Census, 1959. $.60. 
Paper. 

United States Department of Commerce, 
National Bureau of Standards. Tables of 
the Exponential Integral for Complex 
Arguments. Washington, D.C.: National 
Bureau of Standards, 1958. $4.50. 

United States Department of Health, Edu- 
cation, and Welfare, Office of Education. 
Current Expenditures Per Pupil in Public 
School Systems: Small and Medium-Sized 
Cities, 1966-57. Washington, D.C.: U.S. 
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Department of Health, Education, and 
Welfare, 1958. $.25. Paper. 

United States Department of Health, Edu- 
cation, and Welfare, Office of Education. 
Guide to the National aE Act of 1958. 
Washington, D.C.: Department of 
Health, Education, and Welfare, 1959. 
$.30. Paper. 

United States Department of Health, Edu- 
cation and Welfare. Statistics of Land- 
Grant Colleges and Universities. Wash- 
ington, D.C.: U. 8. Department of 
Health, Education, and Welfare, 1959. 
$.55. Paper. 

United States Department of Health, Edu- 
cation and Welfare. Statistics of State 
School Systems: 1955-56: Organization, 
Staff, Pupils Ch. 2. 
Washington, 8S. Department of 
Health, Welfare, 1959. 
$.45. Pap er. 

United States Department of Labour Sta- 
tistics. Frequenc of Change in Wholesale 
Prices—A Stu Flexibility. 
Washington, D. G4 S. Department of 
Labour, 1959. $.30. Pap er. 

United States Joint hneute Committee. 
Employment, Growth, and Price Levels, 
Part 1—The American Economy: Prob- 
lems and Prospects. Washington, D.C.: 
U. 8S. Joint Economic Committee, 1959. 
$.60. Paper. 

United States Joint Economic Committee. 
Hearings Before the Joint ron ie 
Committee. Washington, D.C.: 

Joint Economic Committee, 1959. $2. 25. 
Paper. 

United States Joint Economic Committee. 

Joint Economic Committee Report on the 
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January 1959 Economic Report of the 
President. Washington, D.C.: Joint Eco- 
nomic Committee, 1959. Price not 
marked. Paper. 

United States Joint Economic Committee. 
The Federal Revenue System: Facts and 
Problems 1959. Washington, D.C.: U. 8. 
— Economic Committee., 1959. $.65. 


ville He H. Economic Development. 
New York, New York: Rinehart ont Co., 
.» 1959. $2.50. Paper. 


° 
epartment of Health, Education 
and ‘Wel are). Participation in Adult 
Education. Washington, D.C.: U. 8. De- 
artment of Health, Education, and Wel- 
are, 1959. $.30. Paper. 
Warntz, William. Toward a Goomenhy, of 
Price. Philadelphia, Pennsylvani ni- 
versity of Pennsylvania Press, 1959. 


$5.00. 

Whelpton, P. K., Kiser, Clyde V. (Editors) 
Social and Psychological Factors Affecting 
Fertility— Volume 5. New York, New 
York: Milbank Memorial Fund, 1958. 
$1.00. Pap 7 

Vogt, and Hyman, Ray. Water 

ttching U.S.A. Chicago, Illinois: Uni- 
versity of Chicago Press, 1959. $4.95. 

Wessel, Robert H., illett, Edward R. 
Statistics as Applied to Economics and 
Business. New York, New York: Henry 
Holt and Co., 1959. $5.00. 

Williams, Ernest W., Jr. Freight Trans- 
portation in the Soviet Union: A Compari- 
son with the United States. New York, - 
New York: National Bureau of Economic 
Research, x 1959. $.75. Paper. 


EDITORIAL COLLABORATORS—Continued 


Dinis Sarcon, University of Minnesota 

IRVING ScHWEIGER, University of Chicago 

Senvers, University of Minne- 
sota 

Henry 8. Suryock, Bureau of the Census 

SITGREAVES, Columbia Univer- 
sity 

J. Laurte SNELL, Stanford University 

Rosert Sotow, Massachusetis Institute of 
Technology 

MortTIMER SPreGELMAN, Metropolitan Life 
Insurance Company 

James H. Stepieton, Michigan State Uni- 
versity 

Georce J. Stricter, University of Chicago 

WituramM F. University of Cali- 
fornia, Berkeley 

I. THayer, Washington State Uni- 
versity 

Harotp A. Tuomas, Jr., Harvard Univer- 
sity 


J. Tuompson, University of Pitts- 

ur 

Desens Truax, University of Kansas 

MELVILLE J. ULMER, American University 
Netherlands 

J. R. Vatrnspat, Washington State Univer- 
sity 

Jacos VINER, Princeton University 

Freprerick V. WauaGu, Bureau of Agricul- 
tural Economics 

Joun 8S. Wuire, Minneapolis-Honeywell 
Regulator Company 

D. Ransom Wuitney, Ohio State University 

H. Wop, Uppsala University, Sweden 

Max Woopsury, University of Pennsyl- 
vania 

CuarLes Michigan State Uni- 
versity 

N. Donatp YLvISsAKER, Stanford University 

8. 8. ZarKovic, United Nations 

ZELEN, National Bureau of Stand- 
ar 


< 

: 

| 
: 


AN INVITATION 
TO JOIN ORO 


Pioneer In Operations Research 


Operations Research is a young science, earning 
recognition rapidly as a significant aid to decision- 
making. It employs the services of mathematicians, 
physicists, economists, engineers, political scientists, 
psychologists, and others working on teams to syn- 
thesize all phases of a problem. 


At ORO, a civilian and non-governmental organ- 
ization, you will become one of a team assigned to 
vital military problems in the area of tactics, strategy, 
logistics, weapons systems analysis and communi- 
cations. 


No other Operations Research organization has 
the broad experience of ORO. Founded in 1948 by 
Dr. Ellis A. Johnson, pioneer of U. S. Opsearch, 
ORO’s research findings have influenced decision- 
making on the highest military levels. 


ORO’s professional atmosphere encourages those 
with initiative and imagination to broaden their 
scientific capabilities. 

ORO starting salaries are competitive with those 
of industry and other private research organizations. 
Promotions are based solely on merit. The “fringe” 
benefits offered are ahead of those given by many 
companies. 


The cultural and historical features which attract 
visitors to Washington, D. C. are but a short drive 
from the pleasant Bethesda suburb in which ORO is 
located. Attractive homes and apartments are within 
walking distance and readily available in all price 
ranges. Schools are excellent. 


For further information write: 


OPERATIONS RESEARCH OFFICE 


The Johns Hopkins University 


6936 ARLINGTON ROAD 
BETHESDA 14, MARYLAND 
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New WMcGRAW-HILL Books 


A PRIMER OF PROGRAMMING 
FOR DIGITAL COMPUTERS 
By Marshal H. Wrubel, Indiana University. 230 pages, $7.50. 


An introductory text, designed for junior-senior courses for physical scientists, engineers, 
and all other students who have problems to solve on computers. The purpose of the book 
is to explain how to go about setting up a problem for a digital computer, how to test it, 
and how to make it available to others, The primer discusses procedures common to 

digital electronic machines, but an actual machine—the IBM Type 650—is used for 


examples. 


PROBABILITY AND STATISTICS FOR 


BUSINESS DECISIONS 


An Introduction to Managerial Economics Under Uncertainty 
By Robert Schlaifer, Harvard University. 732 pages, $11.50 


A nonmathematical introduction to the logical analysis of practical business problems in 
which a decision must be reached under uncertainty. The analysis which it recommends is 
based on the modern theory of utility and what has come to be known as the “personal” 
definition of probability. Exercises are provided at the end of each chapter. 


HIGH-SPEED DATA 
PROCESSING 


By C. C. GOTLIEB and J. N. P. HUME, both 
of the University of Toronto. McGraw-Hill 
Series in Information Processing and Com- 
puters. 338 pages, $9.50 


A basic, comprehensive treatment of the im- 
portant principles and general techniques of 
processing data at high speeds. It shows how 
data processors work, how to use them, and 
what their advantages are. Coding and pro- 
gramming methods are included and ex- 
amples of typical applications of high-speed 
data processing are shown. 


APPLIED STATISTICS 
FOR ENGINEERS 


By WILLIAM VOLK, Hydrocarbon Research, 
Inc., Princeton, N.J. McGraw-Hill Series in 
Chemical Engineering. 354 pages, $9.50 


Provides enough background and examples 
of statistical problems to enable practicing 
engineers to apply statistical analysis to 
their data. Emphasizing applications and 
providing many illustrative examples, it 
deals with the treatment of engineering data 
for correlation, precision, and analysis of 
experimental factors. Each chapter is com- 
plete in itself, 


Send for copies on approval 


McGRAW-HILL BOOK COMPANY, INC. 


330 WEST 42no STREET, NEW YORK 36, N.Y. 
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The Economies of Competition in the 
Transportation Industries 


By JOHN R. MEYER, MERTON J. PECK, JOHN STENASON, CHARLES 
ZWICK. The first full-scale statistical analysis of the relative costs of each 
type of American carrier. The authors meet the perplexing problems of eco- 
nomically efficient traffic allocation, and public regulatory ee to achieve 
this effect, with important statistical answers. A major study for transporta- 
tion specialists and economists. $7.50 


Determinants of Consumer Demand for House 
Furnishings and Equipment 
By VERNON G. LIPPITT. Enploys a method both novel and powerful in 


ae ag the results of an investigation into the factors which influence the 
uying of house furnishings. Cross-sectional and time-series data are com- 
bined to assure complete and extensive coverage. Dr. Lippitt’s meticulous 


analysis is capable of extension to other consumer categories, and is in itself 
a solid contribution to the field of economic forecasting. $6: 


Through your bookseller, or from 
HARVARD UNIVERSITY PRESS 
79 Garden Street, Cambridge 38, Massachusetts 


As a long term prime contractor for the 
AEC we design, develop and manufacture 
extremely precise, complex electronic and 
electro-mechanical devices. Our program en- 
ables us to offer these two excellent “ground 
floor” opportunities. They afford many pro- 
fessional and personal advantages well 
worth your immediate investigation. Kansas 
City is a beautiful and progressive city, fea- 
turing convenience, moderate climate, mod- 
ern, uncrowded and highly-rated schools, 
plus many recreational and cultural activi- 
ties. Assistance program for advanced study 
at local universities if desired. 


STATISTICAL ANALYST 


. . . familiar with the underlying concept 
(and mathematics) of process quality con- 
trol, sampling procedures, and experimental 
design and analysis. Several years experi- 
ence in industry together with a degree in 
statistics or a degree in mathematics or en- 
gineering with several courses in statistics, 
is required. This position requires close co- 
ordination with systems engineers in setting 


@ STATISTICAL ANALYST 
@ MATHEMATICAL STATISTICIAN 


up statistical quality control programs to 
insure reliability of the end product. 


MATHEMATICAL STATISTICIAN 

. . with an advanced degree and familiar 
with nonparametric and order statistics, 
Monte Carlo procedures and Markoff proc- 
esses. Experience in implementing these 
methods is desirable, as is some familiarity 
with computers, particularly the IBM 650. 
Your ability to recognize and formulate 
problems is of paramount importance. 
Unique contribution in the fields of opera- 
tions research, reliability evaluation and pre- 
diction, and statistical decision theory are 
expected. 


Mail brief, confidential résumé to: 
Mr. T. H. TILLMAN 
Box 303-KG. Bendix, Kansas City, Mo. 


KANSAS CITY, MISSOURI 
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Ready November, 1959 
INTRODUCTION TO MATHEMATICAL STATISTICS 


ROBERT V. HOGG, Associate Professor of Mathematics 
ALLEN T. CRAIG, Professor of Mathematics, 
both, University of Iowa 


The first publication in a series 
of mathematics texts under the general editorship of 
CARL B, ALLENDOERFER. 


This work for advanced students in mathematics, tested in preliminary 
form in classes at the University of Iowa, introduces in a reasonable and 
natural order fundamental concepts in probability and statistics and 
shows, whenever possible, among these concepts. Topics 
treated include random sampling distribution theory, point and in- 
terval estimation, limiting distributions, distribution free problems and 
tests of statistical hypotheses. Considerable emphasis is placed on the im- 
portance of the concept of a sufficient statistic. Especially noteworthy is 
the inclusion of modern theoretical developments not found in current 
books at this mathematical level. 


Published 


INTRODUCTION TO PROBABILITY AND STATISTICS 

B. W. LINDGREN, Assistant Professor of Mathematics 

G. W. McELRATH, Professor and Head of the Industrial 
Engineering Division, both, University of Minnesota 

This introductory textbook presents classical and modern statistical meth- 
ods based on a preliminary, thorough treatment of the concept of proba- 
bility. Although some background in calculus is presupposed, the work 
is not based on a pure mathematical approach to statistics. Professor Paul 
Randolph, Industrial Engineering Division, Purdue University writes: 
“Chapters 1-5 make up one of the neatest packages on probability basic 
to statistics that I have ever seen. . .. The textual development is orderly, 
the examples good, and the problems superb!” 1959, 277 pages, $6.25 


ELEMENTARY MATRIX ALGEBRA 

FRANZ E. HOHN, Associate Professor of Mathematics, 
University of Illinois 

“. .. in a rather long mathematical career I have seen only a very few 
mathematical books of such excellence in every respect. . . . I can only 
wish there were more mathematical books like this one. . . .”—John P. 
Scholz, Professor of Mathematics, New Mexico Institute of Mining and 
Technology 1958, 305 pages, $7.50 (Text edition) 


The Macmillan Company 


60 FIFTH AVENUE, NEW YORK 11, N.Y. 
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Mathematical 
Statisticians 


& Unusual opportunities exist at Bat- 
telle Memorial Institute for advanced- 
degree level statisticians. Battelle’s 
staff of over 2200 conducts research 
in practically all fields of industrial 
science and in many specialized tech- 
nologies. 


» Challenging research and develop- 
ment assignments on problems for in- 
dustry and government, for statisti- 
cians sufficiently trained in mathe- 
matics and statistics to plan and ana- 
lyze research projects in the engineer- 
ing and physical sciences. 


For the professional who seeks 
more than just a “job,” staff assign- 


ments at Battelle provide the oppor- 
tunity for: 


© Project assignments that recognize the in- 
dividual’s choice and allows freedom of ex- 
pression approaching that offered in aca- 
demic positions, 


© Diversified research assignments making 
tangible contributions to industrial tech- 
nology and national security. 


Continued education at neighboring Ohio 
State University. 


Preparation and presentation of technical 
papers and activity in professional groups. 
Desirable living conditions in a community 
area having broad social and recreational 
activities. 


@ Salaries which are competitive with those 
paid by other organizations for engineers 
and scientists. 


All inquiries, which will be treated in confi- 
dence, should be directed to: 


L. G. Hill, Supervisor of Employment 


Battelle Memorial Institute 


505 King Avenue 
Columbus |, Ohio 


BUSINESS STATISTICS 


By Dr. John R. Stockton 
The University of Texas 


Here is a skillfully written book that stresses the value of statistical analy- 
sis in the solution of business problems. The mathematics needed by the 
student is explained as the principles of analysis and the formulas are 
presented. Many charts are used to illustrate the various uses of the 
graphic method. 


The problem material in BUSINESS STATISTICS is outstanding, both 
in quality and quantity. A workbook, containing twenty long problems, 
is available. 


SOUTH-WESTERN PUBLISHING CO. 


(Specialists in Business and Economic Education) 


Cincinnati 27 New Rochelle, N.Y. Chicago 5 San Francisco 3 Dallas 2 
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A masterly and creative synthesis 


A GENERAL THEORY OF THE PRICE 
LEVEL, OUTPUT, INCOME DISTRIBU- 


TION AND ECONOMIC GROWTH 


By Professor Sidney Weintraub, 

University of Pennsylvania 
Already hailed as one of the most significant contributions in the field of 
economics since Keynes, this volume presents an empirically oriented 
approach which leads to accurate predictions equivalent to the work in 
the physical sciences. It will affect institutional planning and economic 
policies through its new perspectives and insights and will be of para- 
mount importance to all statisticians. $3.50 


Just Published 


HANDBOOK OF AREA SAMPLING 
By John Monroe, University of North Carolina and 
A. L. Finkner, North Carolina State College. 
56 pages, paper illustrated—$3.00 
Send For Free Examination Copy 
COLLEGE DEPARTMENT 
CHILTON COMPANY—Book Division Philadelphia 39, Pa. 


Mathematical Statisticians 


Exceptional opportunities exist at the 
Naval Weapons Laboratory for mathe- 
matical statisticians with MS and PhD 
degrees and an interest in operations re- 
search. The principal efforts of the Op- 
erations Research Group at presen! are 
devoted to the formulation and execution 
of extensive programs in the areas of 
target analysis, weapons system analysis, 
and missile feasibility and evaluation. 
Senior Statisticians on the staff also 


serve as consultants in areas of statistical 
inference, probability, and experimental 
design. The most advanced computing 
equipment and capable junior scientists 
are available for assistance. Starting 
salaries range from $7,510 to $11,595 per 
annum. The Naval Weapons Laboratory 
provides an excellent work atmosphere 
and, in addition, the advantages of living 
in a pleasant small community with eco- 
nomical housing and many recreational 
facilities. 


For further infomation, write to the Director, 
Computation and Analysis Laboratory. 


NWL 


U. S. Naval Weapons Laboratory 
Department of the Navy Dahlgren, Virginia 
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COMPUTING 


SERVICE 


..»Made to Order 
For Researchers 
and Statisticians 


Since few companies have 
enough work volume to jus- 
tify computers of their own, 
STATISTICAL maintains com- 
puting equipment to serve any 
company on a low-cost, hourly, 
as-needed basis. 


This service is built around 
the combined skills of mathe- 
maticians, statisticians, project 
engineers and programming 
specialists—ready to work for 
you on your computing prob- 
lem. 


Here are a few of the appli- 
cations in which our computer 


STATISTICAL 
TABULATING CORPORATION 


Established 1933 


CALCULATING 
TEMPORARY OFFICE PERSONNEL 


TABULATING - 


TYPING 


will excel in saving you time 
and money: 


» Simple and Multiple Cor- 
relations and Regressions 


e Analysis of Variance 
e Factor Analysis 


e Chi Square For A 
Contingency Table 


e Matrix Calculations 
e Linear Programming 
e Curve and Surface Fitting 


e Solution of 
Differential Equations 


Just contact our nearest office 
today for a free analysis and 
cost estimate of your problem. 


GENERAL OFFICES: 


53 West Jackson Boulevard 
Chicago 4, Illinois 
Phone: HArrison 7-4500 


Chicago * New York * St. Louis 
Newark * Cleveland 

Los Angeles * Kansas City 
San Francisco * Milwaukee 
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Just Out eee 


COLLECTING FINANCIAL DATA BY CONSUMER 
PANEL TECHNIQUES: A CASE STUDY 


by ROBERT FERBER 


This is the first of a series of research monographs reporting results of 
studies carried out by the Inter-University Committee for Research on Con- 
sumer Behavior with financial assistance from the Ford Foundation. 


The present monograph summarizes the results obtained from a pilot con- 
sumer panel operation on the savings habits and attitudes of Chicago families. 
Methods and procedures of conducting this study are discussed in detail, with 
particular reference to the collection of asset and debt information. 

About 180 pages, $2.50 


BUREAU OF ECONOMIC AND BUSINESS RESEARCH 
UNIVERSITY OF ILLINOIS 


leox 658, STATION A 
CHAMPAIGN, ILLINOIS 


VITRO EXPANDS 


Increased R&D activities at Vitro Laboratories 
at West Orange, New Jersey, have created oppor- 
tunities for engineers and scientists 

in diversified projects. 


MATHEMATICAL STATISTICIANS 


Advanced navigational systems 
Information Analysis 
Reliability 


OPERATIONS RESEARCH ANALYSTS 


Complex Military Systems requiring familiarity 
with military tactics and logistics. 


Liberal benefits include a tuition refund plan and reloca- 
tion allowances. All inquiries in confidence. Send réswmé 
including salary requirements to Mr. 8. Roberts. 


Vitr (tf LABORATORIES 


Division of Vitro Corp. of America 
200 Pleasant Valley Way, West Orange, New Jersey 
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M. S. 
_EXPERIMENTAL 
STATISTICIAN 


B.S. IN MATHEMATICS 
OR PHYSICAL SCIENCE 


| to 5 years experience 
TO CONSULT WITH 


RESEARCH AND DEVELOPMENT 
ON EXPERIMENTAL DESIGN 


Experience with surface fitting concepts is necessary 
Direct access to IBM-650 


Reply to: 


MR. B. R. DiCAPRIO 
AMERICAN CYANAMID COMPANY 
ORGANIC CHEMICALS DIVISION 
BOUND BROOK, NEW JERSEY 


REVIEW OF ECONOMICS AND STATISTICS 
Published by the Department of Economics, Harvard University 
Editor: SEYMOUR E. HARRIS* 


Associate Editors: Abram Bergson, Edward H. Chamberlin, Robert Dorfman, James S. Duesen- 
berry, John T. Dunlop, John Kenneth Galbraith,* Al k Gottfried Haberler,* 
Alvin H. Hansen,* Carl Kaysen, Wassily Leontief,* John Sone Edward S. Mason, Sumner 
H. Slichter, Arthur Smithies, John H. Williams. 

* Members of Managing Board. 


Volume XLI August 1959 Number 3 


For the Aband of Game T. C. Schelling 
Inflation and Quasi- Elective Changes in Costs 
Expenditure Implications of Metropolitan Growth and Consolidation 
The Utilization of Agricultural Land: A Theoretical and Empirical Inquiry 
James M. Henderson 


E. Cary Brown and Richard J. Kruizenga 
On Concepts and Measures of Changes in Productivity Luigi L. Pasinetti 
Comment .......+++ Robert M. Solow 
Reply Luigi L. Pasinetti 
The Relationship between Tangible et E. Scott Maynes 
A Theory of Speculation Relating at oan oy pots Stability Lester G. Telser 
Reply W. J. Baumol 
A Note on Interest Rates and the Demand for Money Andrew C. Stedry 
NOTES AND BOOK REYIEWS 


The Review of E jes and Statistics is published quarterly. 
Annual subscription price: $7.00. 


Order from the Harvard University Press, 
Cambridge 38, Mass. 
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THE JOURNAL OF FINANCE 


The Journal of THE AMERICAN FINANCE ASSOCIATION 


Vol. XIV September 1959 No. 3 
The Misconceived Problem of International Liquidity ................++.. Leland B. Yeager 
An Empirical Study of Abandonment Decisions ...............000+seeeeeees Jobn F. Due 
Price Changes of Stock-Dividend Shares at Ex-Dividend Dates .......... C. Austin Barker 
The Effects of Installment Credit Term Variation ............--2-00+005 Lawrence Werboff 
Shifting of the Corporate Income Tax: A Dynamic Analysis ..............++.. Don Soule 
COMMUNICATIONS 
A Note on Bank Earnings and Savings 


avid A. “Alba and Charlotte P. Albadeff 
A Note on the Small Mortgage Market .. «seeee-James Gillies and Clayton Curtis 
A Note on Monetary Controls and Capital “Valees C. A. Matthews 


Membership dues, including $3.00 allocated to subscription in The Journal of Finance, are 
$5.00 annually. Libraries may subscribe to The Journal at $5.00 annually and single copies 
may be purchased for $1.25. Applications for “shembership in the American Finance Associ- 
ation and subscriptions to The Journal of Finance should be addressed to the Secretary- 
Treasurer, George E. Hassett, School of Business New York Uni- 
versity, 90 Trinity Place, New "York 6 


Communications relating to the contents of The Journal of Finance should be addressed to 
the Editor, Joel Segall, Graduate School of Business, University of Chicago, Chicago 37, 
Illinois, or to the Associate eo John G. Gurley, The Brookings Institution, 722 Jackson 
Place, N.W., Washington 6, D 


AMERICAN ECONOMIC REVIEW 


VoL_uME XLIX SEPTEMBER 1959 NuMBER 4 
ARTICLES 
Diffusion, Acceleration, and Business Cycles ........00ccececccscccececes G. Hickman 
Utility Theory and Profit Maximization ..,............... J. E. Haring - G. © Smith 
Payments for Labor and Foreign Trade ............+.++ F. M. Williams and E. Eaton 
European Integration and American Trade ............0cececccceeceeceees M. E. Kreinin 
COMMUNICATIONS 
Price-Making in Forest Service Timber Sales ............00.ceeeeeees Sidney Weintraub 
The Cost of Capital and the Theory of Investment: 

Franco Modigliani and M. H. Miller 

Pricing in Companion: 
Comment . ..M. A. 
Industrial Growth in the Soviet Unica: 
R. V. Greenslade and P. A. Wallace 


The AMERICAN ECONOMIC REVIEW, a quarterly, is the official publication of the 
American Economic Association and is sent to all members. The annual dues are six dollars. 
Address editorial communications to Dr. Bernard F. Haley, Editor, AMERICAN ECONOMIC 
REVIEW, Stanford University, Stanford, California. For information concerning other 
publications and activities of the Association, communicate with the Secretary- Treasurer, 
Dr. James Washington Bell, American Economic Association, Northwestern University, Evans- 
ton, Wlinois. Send for information booklet. 
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STATISTICIAN 


Ph.D.’s in Applied or Mathematical Statistics with an interest in 
operations research. Immediate openings at the intermediate and 
senior levels with the Washington Office of an expanding scien- 
tific firm, with a challenging research program involving experi- 
mental design, mathematical models, and program evaluation. Ex- 
perience required. Apply to G. Ronald Herd, Director. 


BOOZ - ALLEN APPLIED RESEARCH, INC. 
4921 Auburn Avenue 
Bethesda 14, Maryland 


ECONOMETRICA 


Journal of the Econometric Society 
July, 1959 (Volume 27, Number 3) 


CONTENTS 
FRANKLIN M. FISHER: Generalization of the Rank and Order Conditions for Identifiability 
W. H. GORMAN: Separable Utility Functions in the Aggregation Problem 
W. H. GORMAN: The Empirical Implications of a Utility Tree: A Comment 
T. P. HILL: An Analysis of the Distribution of Wages and Salaries in Great Britain 
THOMAS MARSCHAK: Centralization and Decentralization in Economic Organizations 
ANDRE NATAF AND C. FOURGEAUD: Consommation en Prix et Revenu Reels et Théorie des 
oix 
JOAN ROBINSON: Letter to the Editor 
ROBERT H. STROTZ: The Utility Tree: A Correction and Further Appraisal 
HIROFUMI UZAWA: The Prices of Factors of Production in International Trade 
PHILIP WOLFE: The Simplex Method for Quadratic Programming 
REPORT OF THE CAMBRIDGE AND CHICAGO MEETINGS 
BOOK REVIEWS 
ANNOUNCEMENTS AND NOTES 


THE ECONOMETRIC SOCIETY 
YALE UNIVERSITY 
NEW HAVEN, CONN. 
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JOURNAL OF BUSINESS 


School of Business, University of Chicago, Chicago 37, Illinois 


VoLuME XXXII July 1959 NUMBER 3 


A Review of Some Economic Forecasts for 1955-1957 Arthur M. Okun 


The Federal Reserve Board Report on Smali Business Financing 
Edwin B. George and Robert ]. Landry 


A New Concept of Performance Appraisal Kenneth E. Richards 


The European Common Market and American Foreign Trade and Investment 
Stanislaw W ellisz 


Seasonal Variation and Business Expectations Donald J. Daly 
On the Efficiency of the Coal Industry Marc Nerlove 
Discussions and Notes Book Reviews 
Books Received Notes: University Schools of Business 


The JOURNAL OF BUSINESS is published quarterly by the Graduate School of Business 
of the University of Chicago. Subscriptions ate $6.00 per year and should be addressed to 
the JOURNAL OF BUSINESS, Graduate School of Business, University of Chicago. Editorial 
correspondence (manuscripts in duplicate) should be addressed to Irving Schweiger, Editor, 
JOURNAL OF BUSINESS at the same address. 


ESTADISTICA 


Journal of the 
Inter American Statistical Institute 
Vor. XVII, No. 62 Marcu 1959 
CONTENTS 
Address to the VI Session of the C i on Imp of National Statistics (COINS)— 
Discurso en la VI Sesién de la Comisié Mej i de las Estadisticas Nacionales 
El método estadistico y la filosofia de la ciencia Goateetind Seccccessoccecoesess Harold Hotelling 
La medida de la distribucién de la poblacién (t DAP scssann<sdsoccesvnd Otis Dudley Duncan 
Calculos de inversiones en paises Un Juicio critico (traduccién) 
La férmula de un nuevo niimero indice (traduccion) Stuvel 
Estudio de las relaciones interindustriales para 1947—Parte Il (traduccién) .........+++se00 
CeCeseerscccecorccsoosceceeesoceccooseccecoscoescececos W. Duane Evans y Marvin Hoffenberg 
La estadisti pidemiolégica en Dario Curiel 
Special Bestuses Comisién de Estadistica de las Naciones Unidas: Informe sobre el X Periodo 
de Sesiones, 28 de abril-15 de mayo de 1958 
Legal Provisions. | ional Resoluti Relating to Statistics. Institute Affairs. Statistical 
News. Publications. 


Published quarterly Annual subscription price $3.00 (U.S.) 


Inter American Statistical Institute 
Pan American Union, Washington 6, D.C. 
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The Annals of Mathematical Statistics 
THE OFFICIAL JOURNAL OF THE INSTITUTE OF 
MATHEMATICAL STATISTICS 


Vol. 30, No. 3—September, 1959 
Contents 
Estimating the Parameters of a Differential Process. Herman Rubin and Howard G. Tucker 
A Problem in Optimum Filtering with Finite Data ...... piiiseabeaihie Gopinath Kallianpur 
On Asymptotic Distributions of Estimates of Parameters of Stochastic Difference 
On the Identifiability Problem for Functions of Finite Markov Chains ..Edgar J. Gilbert 


Imbedded Markov Chain Analysis of a Waiting-Line Process in Continuous Time 
Donald P. Gaver, Jr. 


Asymptotic Expansions in Global Central Limit Theorems ...... . Ralph Palmer Agnew 


On the Existence of a Best Approximation of One Distribution Function by Another 
of a Given Type D. L. Burkbolder 


Bounds on the Expectation of a Convex Function of a Multivariate Random Variable 


The Number of Components in Random Linear Graphs 
T. L. Austin, R. E. Fagen, W. F. Penney, Jobn Riordan 


Sequential Design of Experiments Herman Chernoff 


Complex Representation in the Construction of Rotatable Designs ............ 


The Uniqueness of the Lz Association Scheme ......-...-.seeeeeseeees S. S. Shrikbande 


Combining Inter-Block and Intra-Block Information in Balanced Incomplete Blocks 
Franklin A. Graybill and David L. Weeks 


Asymptotically Efficient Tests Based on the Sums of Observations Jobn H. MacKay 


Linear Estimation in Censored Samples from Multivariate Normal Populations .... 
A. Watterson 


The Expression of k-Statistic ki: in Terms of Power Sums and Sample Moments .... 
M. Ziaud-Din 


Howard G. Tucker 


The Limiting Distribution of the Serial Correlation Coefficient in the Explosive Case 
ll Jobn S. White 


Abstracts of Papers 
Publications Received 


Address orders for subscriptions and back numbers to Professor George E. Nichol- 
son, Jr., Secretary, Institute of Mathematical Statistics, Department of Statistics, 
University of North Carolina, Chapel Hill, North Carolina. 
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TECHNOMETRICS 


A Journal of Statistics for the Physical, 
Chemical and Engineering Sciences 


Vol. 1, No. 2 CONTENTS May 1959 
Measurements Made by Matching with Known Standards ........... 
W. J. Youden, W. S. Connor and N. C. Severo 
Random Balance Experimentation ................. F. E. Satterthwaite 
The Application of Random Balance Designs ........ Thomas A. Budne 


Discussion of the Papers of Messrs. Satterthwait: and Budne ........ 
..W. J. Youden, O. Kempthorne, J. W. Tukey, G. E. P. Box, J. S. Hunter 


Quick Analysis Methods for Random Balance Screening Experiments. . 


Vol. 1, No. 3 CONTENTS August 1959 


Simplified Estimators for the Normal Distribution when Samples are 
Singly Censored or Truncated ................ A. Clifford Cohen, Jr. 


Control Chart Tests Based on Geometric Moving Averages .......... 


Factorial Experiments in Life Testing .................4. Marvin Zelen 

The Use of LaGrange Multipliers with Response Surfaces .......... 
de W. Umland and W. N. Smith 


A Statistical Model for Evaluating the Reliability of Safety Systems for 
Plant Manufacturing Hazardous Products ............ Louis B. Kahn 


Subscription rates: $6.00 per year for members of The American Statis- 
tical Association and the American Society for Quality Control. $8.00 
for non-members. Make remittance payable to Technometrics and send 
to: American Statistical Association, 1757 K St., N.W., Washington 6, 
D.C. 


Please mention the Journal of the AMeRicaNn SratisTicaL Association in writing advertisers 


ie: 
of 


BIOMETRICS 


Journai of the Biometric Society 


Vol. 15, No. 3 CONTENTS September 1959 | 
Biometric Method, Past, Present, and Future J. O. Irwin 
Some Recent Results in Chi-Square Goodness-of-Fit Tests G. S. Watson 
The Sampling Variance of the Genetic Correlation Coefficient Alan Robertson 


Confidence Limits for the LDs. Using the Moving Average-Angle Method 
Eugene K. Harris 


The Use of the Power Function to Determine an Adequate Number of Progeny per 


Sire in a Genetic Experiment Involving Half-Sibs Stanley Wearden 
Contribution to the Study of Grouped Observations. IV. Some Comments on Simple 

Estimates N. F. Gjeddebaek 
Optimum Group Size in Half-Sib Family Selection J. M. Rendel 
Pair Comparison, With and Without Ties N. T. Gridgeman 
On the Development of Clinical Statistical Systems for Psychiatry J. B. Chassan 


A Method for Testing Treatment Effects in the Presence of Learning Seymour Geisser 
The Comparison of the Sensitivities of Similar Experiments: Model II of the Analy- 

sis of Variance D. E. W. Schumann and R. A. Bradley 
Queries and Notes 


Test of Difference Between Treatment and Control with Multiple Replications 
of Control and a Missing Plot David Hogben 


Biometrics is published quarterly. Its objects are to describe and exemplify the use of 
mathematical and statistical methods in biological and related sciences, in a form as- 
similable by experimenters. The annual non-member subscription rate is $7.00. In- 
quiries, orders for back issues, and non-member subscriptions should be addressed to: 
BIOMETRICS, Department of Statistics, Virginia Polytechnic Institute, Blacksburg, 
Virginia. 


BIOMETRICS 


Department of Statistics 
Virginia Polytechnic Institute 
Blacksburg, Virginia 
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CHAPTER PRESIDENTS 


Herbert L. Bryan, Director of Research and Stat., New York State 
Dept. of Correction, 39 Columbia Street, Albany, New York 

N. K. Woerner, Dept. of Public Safety, North Austin Station, 
Austin, Texas 

Frederick Mosteller, Dept. of Statistics, Harvard University, 2 
Divinity Ave., Rm. 311, Cambridge 38, Massachusetts 


Richard N. Schmidt, School of Business Adm., University of 


Buffalo, Buffalo, N. Y. 

Charles R. Hicks, Dept. of Mathematics, Purdue University, 
Lafayette, Indiana 

Robert J. Buehler, Statistical Laboratory, Iowa State College, Ames, 
Towa 

Mrs. Gladys W. Ellsworth, Research & Statistics, N. J. State Dept. 
Cons. & Economic Development, 520 E. State Street, Trenton 7, 
New Jersey 

A. Arthur Charous, Sears Roebuck & Co. Dept. 733C, General 
Merchandise Office, Chicago, Illinois . 

Gregory P. Williams, 2725 Stratford Ave., Cincinnati 20, Ohio 

Richard H. Shaw, Mallory Battery Company, 13000 Athens Ave., 
Lakewood 7, Ohio 

Dennis Stump, Jr., 1205 S. Harrison Street, Denver 10, Colorado 

Robert B. Miner, Dept. of Business Organization, Ohio State Uni- 
versity, Columbus, Ohio = 

Robert R. Kirkland, The Southern New England Telephone Co., 
227 Church Street, New Haven 6, Connecticut 

John E. Condon, 2813 Nacoma Place, Dayton 10, Ohio 

Hugh R. Brown, Business Research Staff, General Motors Corp, 
11-114 G. M. Bldg., Detroit 2, Michigan 

Gordon Frazier, Bank of Hawaii, Bishop & King Streets, Honolulu 
13, Hawaii 

John A. Henry, Dept. of Mechanical Engineering, University of 
Illinois, Urbana, Illinois 

C. R. Henderson, Department of Husbandry, Cornell University, 
Ithaca, New York 

James B. Schultz, 1112 East Howard Avenue, Milwaukee 7, Wis- 
consin 

Roger Lessard, Ecole Polytechnique, 1430 St. Denis, Montreal, Que., 
Canada 

Bernard Harris, Dept. of Mathematics, University of Nebraska, 
Lincoln 8, Nebraska 

Roland Pertuit, 4871 Metropolitan Drive, New Orleans, Louisiana 

Robert E. Lewis, Economics Department, First National City 
Bank of N. Y., New York 15, N. Y. 

Gertrude M. Cox, Institute of Statistics, Box 5457, State College 
Station, Raleigh, North Carolina 

Paul D. Minton, 7317 Wild Valley Drive, Dallas 31, Texas 

William R, Wells, 4401 N.W. 50, Oklahoma City, Oklahoma 

Benjamin J. Tepping, National Analysts, Inc., 1015 Chestnut 
St., Philadelphia 7, Pennsylvania 

C. C. Li, School of Public Health, University of Pittsburgh, Pitts- 
burgh 13, Pennsylvania 

Alvin Mayne, 2169 Calle Gen Del Valle, Santurce, Puerto Rico 

Lionel W. McKenzie, Chairman, Dept. of Economics, University of 

Rochester, Rochester, New York 

Carl M. Frisén, 1570 Castec Drive, Sacramento 21, California 

Charles Roumasset, Bureau of Labor Statistics, Rm. 802, 630 San- 
some Street, San Francisco 11, California 

Alvord L. Boeck, Kwickset Locks., Inc., 516 E. Santa Ana St., 
Anaheim, California 

William 8S. Ray, Associate Prof. of Psychology, Pennsylvania State 
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